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(57) Abstract 

The present invention is directed to nucleic acids encoding glycosyltransfereses, die proteins encoded thereby, and to methods for 
syndiesizing oligosaccharides using d^ glycosyltransfeiases of the invention. In a specific embodiment, a glycosyltransfeiase locus of ^^Ir- 
seria gonorrhoeae containing five open reading frames for five different glycosyltransferases. The fimctioi^ly active glycosyltransf erases 
of die invention are diaracterized by catalyzing reactions such as adding Gal 01-^4 to GicNAc or Glc, adding GalNAc or GlcNAc /9 1-^3 
to (jal; and adding Gal a 1-^4 to Gal. The glycosyltransfeiases of die invention are paiticularly suited to die syntheses of the oligosaccha- 
rides Ga^l-t4GIcNAc^l-^3Ga|/9l-»4Glc (a mimic of lacto-N-neotetiaose). GaINAc01-»3(jaV9l-»4GlcNAq9 W3Ga101-»4Glc/^l-»4 
(a mimic gangliosideX and Galal-»4GaV'l-»4Glc^l-»4Hep-»R (a mimic of die saccharide portion of globoglycolipids). 
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GLYCOSYLTRANSFERASES FOR BIOSYNTHESIS OF 
OLIGOSACCHARIDES, AND GENES ENCODING THEM 

The iiesearch leading to the present uivention was supported in part with fiinds 
from grant number AI*10615 from the Public Health Service. Accoidingly. the 
5 Government may have certain ri^ts in the invention. 

FIELD OF THE INVKNTTON 

The present mvention relates to glycosyltransferases useful for biosynthesis of 
oligosaccharides, genes encoding such glycosyltransferases and recombinant 
methods of producing the enqrmes, and the oUgosaccharides produced thereby. 

10 BACKGROUND OF THE INVENTION 

Neisseria and Lipo-oligosaccharide 
While Neisseria species commonly colonize many mammalian hosts, human beings 
are the only species subject to invasive disease by members of this species. 
Neisseria meningitidis is the ctiologic agent for septicemia and meningitis that may 

15 occur in epidemic form. Neisseria gonorrhoeae is the causative agent of 
gononiiea and its manifold conq>Iications. These organisms, particularly the 
gonococcus, have proved remarkably adept at varying the antigenic array of their 
surface-exposed molecules, notably their adhesive pili and opacity-related (opa) 
proteins. The genetic mechanisms for the variation of pilus (Meyer et al., 1982, 

20 Cell 30:45; Haas and Meyer, 1986, Cell 44:107; Koomey et al., 1987, Genetics 
117:391; Swanson and Koomey, 1989, American Society for Microbiology, 
Washington, 743-761) and opa protein (Stem et al., 1986, Cell 47:61; Meyer et 
al., 1990, Ann. Rev. Microbiol. 44:451; Bhat et al., 1991, Molec. Microbiol. 
5:1889) expression are in the main well understood. Like other Gram-negative 

25 bacteria the Neisseria ssp. carry LPS in the external leaflet of their outer 

membranes (Johnston and Gotschlich, 1974, J. Bacteriol. 119;250). In contrast to 
the high molecular weight LPS molecules with repeating 0-chains seen in many 
enteric bacteria, the LPS of Neisseria ssp. is of modest size and therefore is often 
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referred to as lipooligosaccharide or LOS. Although the molecular size of the 
LOS is similar to that seen in rough LPS mutants of Salmonella ssp., this 
substance has considerable antigenic diversity. In the case of the meningococcus, 
a serological typing scheme has been developed that separates strains into 12 
5 immunotypes (Zollmger and Mandrell, 1977, Infect. Inunun. 18:424; Zollinger 
and Mandrell, 1980, Infect. Immun. 28:451). A remarkably complete 
understanding of the structure of meningococcal LPS (recently reviewed (Verbeul 
et al., 1993, Microbiol. Rev. 57:34) has resulted from the studies of Jemiings and 
his colleagues (Jennings et al., 1983, Carbohyd. Res, 121:233; Michon ct al., 

10 1990, J. Biol. Chem. 265:7243; Gamian et al., 1992, J. BioL Chem. 267:922; 
Pavliak et al., 1993, J. Biol. Chem. 268:14146). In the case of Neisseria 
gonorrhoeae, antigenic variability is so pronounced that a serological classification 
scheme has proved elusive. In part this is due to the heterogeneity of LOS 
synthesized by a particular strain; LOS preparations frequently contain several 

15 closely spaced bands by SDS-PAGE (Mandrell et al., 1986, Infect. Immun. 

54:63). Further, studies using monoclonal antibodies indicate, that gonococci are 
able to change the serological characteristics of the LOS they express and that this 
antigenic variation occurs at a firequency of 10"^ to 10 ^, indicating that some 
gei^tic mechanism must exist to achieve these high frequency variations 

20 (Schneider et al., 1988, Infect. Immun. 56:942; Apicella et al., 1987, Infect. 

Immun. 55:1755). Because of the molecular heterogeneity and antigenic variation 
of the LOS produced by gonococci the determination of the structural chemistry of 
this antigen has proved to be a difficult problem, and definitive information based 
on very sophisticated analyses has only recently become available (Yamasaki et al, 

25 1991, Biochemistry 30:10566; Kerwood et al., 1992, Biochemistry 31:12760; John 
et al., 1991, J. Biol. Chem. 266:19303; Gibson et al., 1993, J. Bacteriol. 
175:2702). These are summarized in Figure 1. Of particular interest is the 
presence of the tetrasaccharide Gal01-i4GIcNAc/Jl-»3Galj31-^4Glc^l-^, which is 
a perfect mimic of lacto-N-neotetraose of the sphingolipid paragloboside (Mandrell 

30 et al., 1988, J. Exp. Med. 168:107; Tsai and Civin, 1991, Infect. Immun. 
59:3604). In LOS this tetrasaccharide frequently bears an additional N-acetyl 
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gaJactosamine residue (GaINAc/Sl-3Gal/Jl-^lcNAc/51-3Gal/Jl-MGIc/Sl-^), and 
tben mimics gangliosides. In some strains of gonococci an alternative side chain is 
found which has the structure GaIal-»4Galj81-^Glc^l-^Hep-R (John et al.. 
1991, J. Biol. Chem. 266:19303). This is a mimic of the saccharide portion of 
5 globo-glycoiipids (Mandrel!. 1992. Infect. Immun. 60:3017). and is the stnicture 
characteristically found in Neisseria meningitidis immunotype LI, 

The LOS molecules have a number of biological activities. They are potent 
endotoxic molecules believed to be die toxin re^xmsible for adrenal cortical 
necrosis seen m severe nmingococcal disease. They save as die target antigen 
10 for much of die bactericidal activity present in normal or convalescem human sera 
(Rice et al., 1980, J. Immunol. 124:2105). Gonococci possess a very unusual 
sialyl transferase activity which is able to use externally supplied CMP-NANA and 

add N-acetyl neuraminic add to die LOS on die surface of die organism (Nairn et 
al., 1988. L Gen. Microbiol. 134:3295; Parsons et al.. 1989, Microb. Padiog. 

15 7:63; MandreU et al., 1990. J. Exp. Med. 171:1649). Group B and C 
meningococci, have the c^jacity to syndiesize CMP-NANA. and frequently 
sialylaie dieir LOS widiout requiring exogenous CMP-NANA (Mandrell et al., 
1991, J. Bacteriol. 173:2823). In Neisseria meningitidis strain 6275 inununotype 
L3, die sialic acid unit is linked a2— 3 to die terminal Gal residue of die lacto-N- 

20 neotetraose (Yamasaki et al., 1993, J. Bacteriol. 175:4565). The levels of CMP- 
NANA found in various host environments is sufficient to support this reaction 
(Apicella et al.. 1990, J, Infect. Dis. 162:506). The sialylation of die LOS causes 
gonococci to become resistam to die antibody-conqilement dependent bactericidal 
effea of serum (Parsons et al., 1989, Microb. Padiog. 7:63). The resistance is 

25 not only to die bactericidal effect mediated by antibodies to LOS. but to odm- 
surfiace antigens as well (Wetzler et al., 1992, Infect. Immun. 60:39). van Putten 
has demonstrated diat exposure of gonococci to CMP-NANA maricedly reduces 
their ability to invade epidielial cells in tissue culture (Van Putten, 1993, EMBO J. 
12:4043). These fmdings strongly suggest diat die ability of gonococci to vary die 
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chemical nature of the LOS provides them with the ability to cope with different 
host environments (Maodrell and Apicella, 1993, ImmwK>biology 187:382). 

Pertiaps most telling, it has been found that LOS variation is selected in vivo in 
infections of human beings. A well characterized gonococcal laboratory strain 
5 MSll^ variant A was used to inoculate vohinieers (Swanson et al., 1988. J. Exp, 
Med. 168:2121). In the two infected individuals over a period of 4 to 6 days the 
pqmlation of gonococci recovered in their urine increasingly shifted to two 
variants that expressed antigenically different LOS (Schneider et al., 1991, J. Exp. 
Med. 174:1601). A structural analysis revealed that the inoculated variant A 
10 produced a truncated LOS containii^ only the i3-Iactosyl group linked to Hepl, 
while one of the new variants (variant C) produced a conq)lete LOS (Kerwood et 
al., 1992, Biochemistry 31:12760). This suggests that the addition of the 
additional sugars GalNAc/31-*3GaI^lr»4GlcNAc/31-*3 is likely to be under control 
of a phase variation mechanism. 

IS Little information on the genetics of LOS synthesis in Neisseria is available. A 
major advance has been the creation (Dudas and Apicella, 1988, Infect. Immun. 
56:499) and bioctemical characterization (John et al., 1991, J. Biol. Chem. 
266:19303) of five pyocin mutants of gonococcal strain 1291, dubbed 1291a-e. 
Immunological and biochemical data have shown that 1291a, 1291c, 1291d and 

20 1291e produce LOS with sequential shortening of the lacto-N-neotetraose chain* 
with mutant 1291e lacking the glucose substitution on the heptose. Mutant 1291b 
synthesizes the alternative LOS structure Galcrl-»4Galj9l-»4Glc (see Figure 1). 
Only the genetic basis of the 1291e mutant is now defined. It is a mutation of 
phosphoglucomutase (pg/n), which precludes the synthesis of UDP-glucose, and 

25 hence the addition of the first residue of the lacto-N-neotetraose unit (Zhou et al., 
1994, J. Biol. Chem. 269:11162; Sandlin and Stein, 1994, J. Bacteriol. 
176:2930). It also has been shown that galE mutants of meningococcus or 
gonococcus produce tniiK:ated LOS in keeping with the inability to synthesize 
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UDP-galactose (Robertson et al., 1993, Molec. Microbiol. 8:891; Jennings et al., 
1993, Molec. Microbiol. 10:361). 

Biosynthesis of Oliff nsac charides 
Oligosaccharides are polymers of varying number of residues, linkages, and 
5 subunits. The basic subunit is a carbohydrate monosaccharide or sugar, such as 
mannose, glucose, galactose, N-acetylglucosamine, N*acetyIgalactosamine, and die 
like. The number of different possible stereoisomeric oligosaccharide chains is 
enormous. 

Oligosaccharides and polysaccharides play an inq)ortant role in protein function 
10 and activity, by serving as half-life modulators, and, in some instances, by 
providing structure. As pointed out above, oligosaccharides are critical to the 
antigenic variability, and hence inmiune evasion, of Neisseria^ especially 
gonococcus. 

Numerous classical techniques for the synthesis of carbohydrates have been 
15 developed, but these techniques suffer the difficulty of requiring selective 
protection and deprotection. Organic synthesis of oligosaccharides is further 
hampered by the lability of may glycosidic bonds, difficulties in achieving regio- 
selective sugar coupling, and generally low synthetic yields. In short, unlike the 
experience with peptide synthesis, traditional synthetic organic chemistry cannot 
20 provide for quantitative, reliable synthesis of even fairly simple oligosaccharides. 

Recent advances in oligosaccharide synthesis have occurred with the isolation of 
glycosyltransferases. These em^mes can be used in vitro to prepare 
oligosaccharides and polysaccharides (see. e.g.. Roth, U.S. Patent No. 5.180,674, 
issued January 19, 1993). The advantage of biosynthesis with glycosyltransferases 
25 is that the glycosidic linkages formed by enzymes are highly stereo and regio* 
specific. However, each enzyme catalyzes linkage of specific sugar residues to 
other specific acceptor molecules, e.g. , an oligosaccharide or lipid. Thus, 
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syntfiesis of a desired oligosaccharide may be limited by the availability of 
glycosyltiansferases (see. Roth, International Patent Publication No. WO 
93/13198, published July 8, 1993). 

Another drawback of biosynthesis is that the glycosyltransferases themselves are 
5 usually present in fairiy low quantities in cells. It is difficult to obtain enough of 
the enzyme to be commercially practicable. 

Thus, there is a great need in the art for glycosyltransferases. There is a further 
need for genes encoding such glycosyltransferases, to provide an unlimited source 
of glycosyltransferases through recombinant technology. 

10 The citation of any reference herein should not be construed as an admission that 
such reference is available as prior art to the instant invention. 

SUMMARY OF THE INVENTION 

The present invention is directed to nucleic acids encoding glycosyltransferases, 
the proteins encoded thereby, and to methods for synthesizing oligosaccharides 

15 using the glycosyltransferases of the invention. Accordingly, in one aspect, the 
invention is directed to a purified nucleic acid that is hybridizable under 
moderately stringent conditions to a nucleic acid corresponding to the LOS locus 
of Neisseria, e.g. , a nucleic acid having a nucleotide sequence corresponding to or 
complementary to the nucleotide sequence shown in Figure 2 (SEQ ID NO:l). 

20 Preferably, the nucleic acid of the invention is hybridizable to a portion of the 
codii^ sequence for a gene of the LOS locus, /.e. , a portion of the nucleotide 
sequence shown in Figure 2 (SEQ ID N0:1) that encodes a functionally active 
glyco^ltransferase. 

In specific embodiments, the invention relates to a nucleic acid that has a 
25 nucleotide sequence corre^nding to or complementary to a portion of the 
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nucleotide sequence shown in Figure 2 (SEQ ID NO: 1) that encodes a functionally 
active glycosyltransferase. In a further aspect, the nucleic acid encodes a 
functionally active glycosyltransferase. In a specific embodiment, the invention is 
directed to a nucleic acid that has a nucleotide sequence correspondmg to or 
5 complementary to the nucleotide sequence shown in Figure 2 (SEQ ID NO:l). 

The functionally active glycosyltransferases of the invention are characterized by 
catalyzing a reaction selected from the group consisting of: 

adding Gal ffl-M to GlcNAc or .(He; 

adding (SalNAc or GIcNAc /31-»3 to Gral; and 
10 adding (lal al-»4 to Gal. 

Most preferably, the claimed nucleic acid encodes a functionally active 
glycosyltransferase. However, nucleic acids of the invention include 
oligonucleotides useful as primers for polymerase chain reaction (PCR) or for 
probes for the presence and level of transcription of a glycosyltransferase gene. 

15 In specific embodiments, exemplified herein, the nucleic acid encodes a 

glycosyltransferase having an amino acid sequence of SEQ ID NO:3, SEQ ID 
NO:4. SEQ ID NO:5. SEQ ID NO:6, or SEQ ID NO:8. 

The invention further relates to an expression vector comprisii^ the nucleic acid 
encoding a glycosyloransferiase of the invention operatively associated with an 
20 expression control sequence. Accordingly, the invention extends to recombinant 
host cell transformed with such an expression vector. 

In another aspect, the invention is directed to a method for producing a 
glycosyltransferase comprising culturing the recombinant host cell under conditions 
that allow expression of the glycosyltransferase; and recovering the expressed 
25 glycosyltransferase. 
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In a primary aspect, the invention is directed to glycosyltransferase having an 
amino acid sequence of SEQ ID N0:3, SEQ ID N0:4, SEQ ID NO:5, SEQ ID 
N0:6, or SEQ ID N0:8, a functionally active fragment thereof. The invention 
further comenq>lates a conq^osition conq)nsing a glycosyltransferase conjugated to 
5 a solid phase support, wherem the glycosyltransferase is selected from the group 
consisting of a glycoqrltransferase having an amino acid sequence of SEQ ID 
N0:3, or a functionally active fragment thereof; a glycosyltransferase having an 
amino acid sequence of SEQ ID NO:8, or a functionally active fragment thereof; a 
glycosyltransferase having an amino acid sequence of SEQ ID NO:4, or a 
10 functionally active fragment thereof; and a glycosyltransferase havir^g an amino 
acid sequence of SEQ ID NO:5, or a functionally active fragment thereof; and a 
glycosyltransferase having an amino acid sequence of SEQ ID NO:6, or a 
functionally active fragment thmof . 

Having provided novel glycosyltransferases, and genes encoding the same, the 

15 invention accordingly further provides methods for preparing oligosaccharides, 
e.g., two or more saccharides. In specific embodiments, the invention relates to a 
method for adding GalNAc or GlcNAc /31— 3 to Gal, comprising contacting a 
reaction mixnire comprisii^ an activated GalNAc or GlcNAc to an acceptor 
moiety comprising a Gal residue in the presence of the glycosyltransferase having 

20 an amino acid sequence of SEQ ID N0:3; a method for adding Gal /81-4 to 

GlcNAc or Glc. conq)rising contacting a reaction mixture comprising an activated 
Gal to an acceptor moiety comprising a GlcNAc or Glc residue in tte presence of 
the glycosyltransferase having an amino acid sequence of SEQ ID NO: 8; a method 
for adding Gal al-*4 to Gal. comprisiqg contacting a reaction mixture comprising 

25 an activated Gal to an acceptor moiety comprising a Gal residue in the presence of 
the glycosyltransferase having an amino acid sequence of SEQ ID N0:4; a method 
for adding GalNAc or GlcNAc 01^*3 to Gal, comprising contacting a reaction 
mixture comprising an activated GalNAc or GlcNAc to an acceptor moiety 
comprising a Gal residue in the presence of the glycosyltransferase having an 

30 amino acid sequence of SEQ ID N0:5; and a method for adding Gal 01^*4 to 
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GlcNAc or Glc, comprising contacting a reaction mixture comprising an activated 
Gal to an acceptor moiety comprising a GlcNAc or Glc residue in the presence of 
the glycosyltransferase having an amino acid sequence of SEQ ID N0:6. 

In a preferred embodiment, the oligosaccharides are prepaid on a carrier that is 
5 non-toxic to a mammal, in particular a human, such as a lipid isoprenoid or 
polyisoprenoid alcohol. A specific exaixq>le of such a carrier is dolichol 
phosphate. In a ^ific embodiment, the oligosaccharide is attached to the carrier 
via a labile bond, thus allowing for chemically, removing the oligosaccharide from 
the lipid carrier. Alternatively, an oligosaccharide transferase can be used, e.g. . 
10 to transfer the oligosaccharide fitnn a lipid carrier to a protem. In yet another 
embodiment, the glycosyltransfcrases can be expressed in a eukaryotic expression 
system, to provide for glycosylation of a protein expressed m such a system. 

An important advantage of the present invention is that it provides for the 
synthesis of oligosaccharide antigens of Neisseria independently of lipid A, which 

15 is highly toxic. Use of the natural LOS from Neisseria, while theoretically 
desirable for vaccine preparation, fails. The lipid A portion of LOS is a potent 
endotoxin, and highly toxic. Chemical treatment of the LOS, e.g,, by hydrolysis, 
destroys the antigenicity of the oligosaccharide, leaving a useless product. Thus, 
it is highly desirable to have a source of Neisseria oligosaccharides attached to 

20 non-toxic lipids for vaccine preparation. 

Thus, the invention provides glycosyltransferases and strategies for preparing a 
number of oligosaccharides, such as but not Innited to, Galal-*4Gal/31'-«4Glc, 
Gal/31-»4G]cNAcj31-»3Galj31-N|GIc, and 
GalNAc^l-3Gali81-wKjlcNAc01-^3Gal/91-MGlc, 

25 Accordingly, it is a primary object of the invention to provide glycosyltransferases 
useful for the synthesis of oligosaccharides. 
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It is a further object of the invention to provide for the synthesis of 
oligosaccharides characteristic of Neisseria meningitidis and N. gonorrhoeae. 

It is a further object of the invention to provide for the synthesis of 
oligosaccharides characteristic of mammalian oligosaccharides, including blood 
5 group core oligosaccharides. 

It is still a further object of the mvention to provide for vaccines having the 
oligosaccharide unit of LOS, but lacking lipid. A. 

Still a further object of the invention is to provide for synthesis of therapeutically 
useful oligosaccharides. 

10 These and other objects of the present will be made clear by reference to the 
following Drawings and Detailed Description. 

BRIEF DESCRIPTION OF THE DRAAVINGS 

Figure 1: Alternative structures found in gonococcal LOS. Rl refers to the inner 
core region of LOS consisting of two keto-deoxy*octulosonic acid (KDO) residues. 

15 These in turn are attached to a lipid A structure. R2 in gonococci is typically 
GlcNAc/31-»2Hepal-*3. The structure in the top panel contains a tetrasaccharide 
identical to lacto-N-neotetraose found in paragloboside glycolipids. In many 
stirains this tetrasaccharide bears a terminal GalNAc01-»3. The tower panel shows 
an alternative trisaccharide structure with the terminal Gal al-M linked. This 

20 trisaccharide is seen in meningococci of the LI serotype and in some gonococcal 
strains. The portions of the two structures recognized by the monoclonal 
antibodies used in this study are indicated (4C4) (Dudas and Apicella, 1988, 
Infect. Immun. 56:499) 3F11 (Mandrell et al., 1988, J. Exp. Med. 168-107; 
Yamasaki et al.. 1991, Mol. Immunol. 28:1233) 1-1-M (Yamasaki et al., 1991, 

25 Mol. Immunol. 28:1233), 2-1-L8 (Kerwood et al., 1992, Biochemistry 31:12760; 
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Schneider et al., 1991, J. Exp. Med. 174:1601; Schneider et al., 1985. Infect. 
Inunun. 50:672) 9-2-L378 and 17-1-Ll. 

Figure 2: (A) Genetic map of the LOS locus based on the DNA sequence. 
Sequence information bp 1-2725 was obtained from plasmid pPstCla, 1^ 2725- 
5 5859 from plasmid p3400 (see materials and methods). IS refers to an area of the 
sequence that has homology to a previously rqxmed neisseria! insertion sequence 
IS1106 (Knight et al.. 1992. Moiec. Microbiol. 6:1565). The positions of the 
reading frames of IffA-E are indicated. Three tracts of poly-G were found in IgtA 
(17 bp). IgtC (10 bp) and IgtD (11 bp) and are indicated by vertical black bars. 
10 Amino acid sequences of (B) LgtA (SEQ ID N0:3), (C) LgtB (SEQ ID NO:8), 
(D) LgiC (SEQ ID N0:4). (E) LgtD (SEQ ID NO:5), and (F) LgtE (SEQ ID 
NO:6), and (G-M) the nucleotide sequence of the Igt locus (SEQ ID NO:l). 

Figure 3(A,B): Homology of the protein products of IgtA and IgtD. The primary 
structure of two proteins is very similar, particularly in die first half of the 
15 sequences. The glycine residues starting at position 86 reflect die coding of the 
poly-G regions in the respective genes. The Bestflt program of the GCG package 
was used and the ^bols | . :. . represent degrees of similariQ' based on the 
Dayhoff PAM-250 matrix. 

Figure 4(A.B): Homology of the protein products of IgtB and IgtE. The primary 
20 structure of two proteins is very similar, particularly in the first half of the 

sequences. These sequences also have significant homology to lex-1 (Cope et al., 
1991. Molec. Microbiol. 5:1113) or Uc2A (High et al.. 1993. Motec. Microbiol. 
9: 1275) genes of HamopfUtus irtftuemae. For meaning of symbols see Figure 3. 

Figure 5(A,B): Homology of die protein products of r/h/ and IgtC. The E. coli 
25 ifal and rfaJ g«ies are very closely related. They serve as glucosyl transferases 
of two gfaicose residues in die LPS core region (Pradel et al., 1992. J. Bacteriol. 
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174:4736). The glycines at position 54-56 in IgtC are encoded by the poly-G 
tract. For meaning of symbols see Figure 3. 

Figure 6: I>eletions in the LOS locus. Three insertion and five deletions of the 
LOS locus were constnicted as detailed in the methods section. The restriction 
5 sites that were used are indicated. The insertions are marked by triangles and the 
extent of the deletions by stippled boxes. The open arrows indicate tte opon 
reading frames disnipted by the construction. In each of the constructs the 
erythromycin marter ermC was ins^ted at the site of die insertion or the 
deletion. 

10 Figure 7: Silver-stained SDS-PAGE of LOS preparations. Gel electrq>horesis of 
purified LOS samples of 375 ng was performed and stained as described in 
materials and methods. Above the gel are indicated the structure of the LOS of 
the major bands inferred to be present in each of the preparations. These 
structures are based on the reactivity with monoclonal antibodies shown in Figure 

15 8» but arc presented in this Figure to facilitate interpretation of the patterns 
observed. R stands for the inner core region and lipid A. 1291e is a pyocin 
resistant mutant (Ehidas and Apicella, 1988, Infect. Immun. 56:499) 

Figure 8: Reactivity of LOS from strain F62 wt and mutants with monoclonal 
antibodies. The names of the following monoclonal antibodies were abbreviated: 
20 17-1-Ll (LI), 9-2-L378 (U), 2-1-L8 (L8). Purified LOS was applied to 

Immobilon-P membranes, allowed to react with the antibodies and developed as 
described in materials and methods. The specificity of the monoclonal antibodies 
is summarized in Figure 1. 

DFTATLFn DFiSr RIPTION OF THE INVENTION 

25 As disclosed above, the present invention provides five novel glycosyltransferases, 
genes encoding the glycosyltransferases, and methods for biosynthesis of 
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Oligosaccharides using such glycosyl transferases. The glycosyl transferases of the 
invention can be used for in vitro biosynthesis of various oligosaccharides, such as 
the core oligosaccharide of the human blood group antigens, i.e., lacto-N- 
neotetraose. 

5 Cloning and expression of glycosyltransferases of the invention can be 
accomplished usmg standard techniques, as disclosed herein. Such glycosyl 
transferases are useful for biosynthesis of oligosaccharides in vitro, or alternatively 
genes encoding such glycosyltransferases can be transfected into cells, e.g. , yeast 
cells or rakaryotic cells, to provide for alternative glycosylation of proteins and 
10 lipids. 

The instant invention is based, in part, on the discoveiy and cloning of a locus 
involved in the biosynthesis of gonococcal LOS has from gonococcal strain F62. 
The locus contains five open reading frames. The first and the second reading 
frames are homologous, but not identical to the fourth and the fifth reading frames 

15 respectively. Interposed is an additional reading frame which has distant 

homology to the E. coii rfal and rfaJ genes, both glucosyl transferases involved in 
LPS core biosynthesis. The second and the fifth reading frames show strong 
homology to the lex-l or lic2A gene of Haemophilus influenzae, but do not 
contain the CAAT repeats found in this gene. Deletions of each of these five 

20 genes, of combinations of genes, and of the entire locus were constructed and 
mtroduced imo parental gonococcal snrain F62 by transformation. The LOS 
pheno^T^ were then analyzed by SDS-PAGE and reactivity with monoclonal 
antibodies. Analysis of the gonococcal mutants indicates that four of these genes 
are the glycosyl transferases that add GalNAci91->»3Gali31-»4GlcNAc/?l-<>3Gali31-<4 

25 to the substrate Glc/31-i4Hep-«^R of the inner core region. The gene with 
homology to E. coli ffal/rfaJ is involved with the addition of the o-linked 
galactose residue in the biosynthesis of the alternative LOS strucmre 
Galal-*4Gal/31-^GIci91-*4Hep-4l. 
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Since ttese genes encode LOS glycosyl transferases they have been named IgtA, 
IgtB, IgtQ IgtD and IgtE. The DNA sequence analysis revealed that IgtA, IgtC 
and IgtD contain poly-G tracts, which in strain F62 were respectively 17, 10 and 
1 1 bp. Thus, three of the LOS biosynthetic enzymes are potentially susceptible to 
5 premature termination by reading-frame changes. It is likely that these structural 
features are responsible for the high frequency genetic variation of gonococcal 
LOS. 

Abbreviations used tfiroughout this speciflcation mchide: Lipppolysaccharide, LPS; 
Ltpooligosaccharide, LOS; N-Acetyl-neuraminic acid cytidine mono phosphate, 
10 CMP-NANA; wild type, wt; Gal, galactose; Glc, glucose; NAc, N-acetyl (e.g., 
GalNAc or GlcNAc). 

In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the 
skill of the art. Such techniques are explained fully in the literature. See, e.g., 

15 Sambrook, Fritsch & Maniatis, "Molecular Cloning: A Laboratory Manual,** 

Second Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
New York (herein "Sambrook et al., 1989"); **DNA Cloning: A Practical 
Approach," Volumes I and II (D.N. Glover ed. 1985); "Oligonucleotide 
Synthesis" (M.J. Gait ed. 1984); "Nucleic Acid Hybridization" [B.D. Hames & 

20 S.J. Higgins eds. (1985)]; "Transcription And Translation** [B.D. Hames & S.J. 
Higgins, eds. (1984)]; "Animal Cell Culture" [R.I. Freshney, ed. (1986)1; 
"Immobilized Cells And Enzymes" [IRL Press, (1986)]; B. Perbal, "A Practical 
Guide To Molecuhu- Cloning" (1984). 

Therefore, if appearing herein, the following tenns shall have the definitions set 
25 out below. 

A cell has been "transformed** by exogenous or heterologous DNA when such 
DNA has been introduced inside the cell; the cell may express a gene or genes 
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encoded by such DNA. The transfonning DNA may or may not be integrated 
(covalently linked) fauo chromosomal DNA making up the genome of the cell, or 
may be contained on an autonomous rq)Iicon. In prokaryotes, yeast, and 
m a mm a li a n cells for example, the transforming DNA may be maintainpti on an 
5 episomal elemem such as a plasmid. A "clone" is a population of cells derived 
firtan a single cell or conunon ancestor by mitosis. 

A "nucleic acid molecule" refers to the phosphate ester polymeric form of 
ribonucleosides (adenosme. guanosnie. uridine.or cytidine; "RNA molecutes") or 
deoxyribonucleosides (deoxyadenosioe. deoxyguanosine. deoxythymidme, or 

10 deoxycytidine; "DNA molecules") in either siqgle stranded form, or a double- 
stranded helix. Double stranded DNA-DNA, DNA-RNA and RNA-RNA helices 
are possible. The term mideic acid molecule, and m particular DNA or RNA 
molecule, refers only to tbt primary and secondary strucnire of the molecule, and 
does not limit it to any particular tertiary forms. Thus, this term includes double- 

15 stranded DNA found, inter alia, in linear or circular DNA molecules (e.g. , 
restriction fragments), viruses, plasmids, and chromosomes. In discussing the 
structure of particular double-stranded DNA molecules, sequences may be 
described herein according to the normal convention of giving only the sequence in 
the 5' to 3' direction along the nontranscribed strand of DNA (i.e.. the strand 
20 having a sequence homologous to the mRNA). A "recombinant DNA molecule- 
is a DNA molecule that has undergone a molecular biological manipulation. 

A nucleic acid molecule is "hybridizable" to anodier nucleic acid molecule, such 
as a cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic 
acid molecule can anneal to the other nucleic acid molecule under the appropriate 
25 conditions of temperature and sohition ionic strength (see Sambrook et al. , 1989, 
supra). The coi^itions of tempoature and ionic strength determine the 
"stringenqr" of the hybridization. Hybridization requires that the two nucleic 
acids contain complementary sequences, although depending on the stringency of 
the hybridization, mismatches between bases are possible. The appfx>priate 
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stringeiKry for hybridizing nucleic acids dq)ends on the length of the nucleic acids 
and the degree of complementation, variables well known in the art. The greater 
the degree of similarity or homology between two nucleotide sequences, the 
greater the value of T„ for hybrids of nucleic acids having those sequences. The 
5 relative stability (corresponding to higher TJ of nucleic acid hybridizations 
decreases in the followipg order: RNA:RNA, DNA:RNA, DNAiDNA. For 
hybrids of greater than 100 nucleotides in length, equations for calculating T„ have 
been derived {see Sambrook et al., 5i(pra, 9.S0-9.51). For hybridization with 
shorter nucleic acids, i.e., oligomicleotides, die position of mismatches becomes 
10 more important, and the length of the oligonucleotide d^rmines its specificity 
{see Sambrook et al., supra, 11.7-11.8). Preferably a minimum length for a 
bybridizable nucleic acid is at least about 10 nucleotides; more preferably at least 
about IS nucleotides; most preferably die length is at least about 20 nucleotides. 

A DNA "coding sequence" is a double-stranded DNA sequence which is 
IS transcribed and translated into a polypeptide in vivo when placed under the control 
of appropriate regulatory sequences. The boundaries of the coding sequence are 
determined by a start codon at the S' (amino) terminus and a translation stop 
codon at the 3' (carboxyl) terminus. A coding sequence can include, but is not 
limited to, prokaryotic sequences, cDNA from eukaryotic mRNA, genomic DNA 
20 sequences from eukaryotic (e.g., manmialian) DNA, and even synthetic DNA 
sequences. If the coding sequence is intended for expression in a eukaryotic cell, 
a polyadenylation signal and transcription termination sequence will usually be 
located 3' to the coding sequence. 

Transcriptional and translational control sequences are DNA regulatory sequences, 
25 such as promoters, enhancers, terminators, and the like, that provide for the 
expression of a coding sequence in a host cell. Although the individual genes 
encoding glycosyltransferases of the invention are found in a single locus with 
very short non-coding sequences b^een them, phase variation resulting m 
deletion of any of IgtA^ IgtB, or IgtC does not preclude reinitiation of transcription 
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at the downstream genes. Thus, the locus provided terein includes transcription 
initiation sequences for transcription in Neisseria, Alternatively, the coding 
sequences of the invention can be engineered for e>q)ression under control of 
heterologous control sequences. 

5 A "promoter sequence" is a DNA regulatory region capable of binding RNA 
polymerase in a cell and initiating transcTq>tion of a downstream (3' direction) 
coAmg sequrace. For purposes of defining the present invention, the promoter 
sequrace is bounded at its 3' terminus by tte transcription initiation site and 
extends upstream (5' direction) to inchide the mininmm number of bases or 

10 elements necessary to initiate transcription at levels detectable above background. 
Within the promoter sequence will be found a transcription initiation site 
(conveniently defind for exanq)Ie, by mapping with nuclease SI), as well as 
protein binding domains (consensus sequences) responsible for the binding of RNA 
polymerase. Eukaryotic promoters will often, but not always, contain "TATA" 

15 boxes and "CAT" boxes. 

A coding sequence is "imder the control" of transcriptional and translational 
control sequences in a cell when RNA polymerase transcribes the coding sequence 
into mRNA, which is then translated into the protein encoded by the coding 
sequence. 

20 A "signal sequence" can be included before the coding sequence. This sequene 
encodes a signal peptide, N-terminal to the polypeptide, that directs the host cell to 
translocate the polypeptide to the cell surface or to organelles within the cell, or 
secrete the polypeptide into the media, and this signal peptide is usually selectively 
cleaved by the protein transport machinery. Signal sequences can be found 

25 associated with a variety of proteins native to prokaryotes and eukaryotes. 

Incorporation of a signal sequence may be desirable for high level expression of a 
glycosyltransferase of die invention by bacteria, yeast, insect cells (baculo virus), 
or eukaryotic cells, to avoid affecting endogenous glycosyltransfer in the host cell. 



wo 96/10086 



PCr/US9S/12317 



18 

A molecule is "antigenic" when it is capable of ^ecifically interacting with an 
antigen recognition molecule of the inmiiine system, such as an immunoglobulin 
(antibody) or T cell antigen receptor. As mentioned above, the carbohydrate 
(oligosaccharide) moiety of the LOS of Neisseria is an important antigenic 
5 determinant, which determines serotype of meningococcus (Zollinger and 
Mandrell, 1977, Infect. Immun. 18:424; Zollinger and Mandiell, 1980, Infect. 
Immun. 28:451). An antigenic portion of a molecule can be that portion that is 
immunodominant for antibody, or it can be a portion used to generate an antibody 
to the molecule by conjugatii^ the antigenic portion to a carrier molecule for 
10 immun iza t i o n. A molecule that is antigenic need not be itself immunogenic, i.e. , 
capable of eliciting an immune response widiout a carrier. 

A composition comprising "A" (where "A" is a single protein, DNA molecule, 
vector, etc.) is substantially finee of "B** (where "B" comprises one or more 
conta min ating proteins, DNA molecules, vectors, etc.) when at least about 75% by 

15 weight of the proteins, DNA, vectors (depending on the category of species to 
which A ^ B belong) in the conq)ositi6n is "A". Preferably, "A" comprises at 
least about 90% by weight of the A+B species in the conqwsition, most 
preferably at least about 99% by weight. It is also preferred that a composition, 
which is substantially free of contamination, contain only a single molecular 

20 weight species having the activity or characteristic of the species of interest. 

The phrase "pharmaceutically acceptable" refers to molecular entities and 
conq)ositions that are physiologically tolerable and do not typically produce an 
allergic or similar untoward reaction, such as gastric upset, dizziness and the like, 
when administered to a human. Preferably, as used herein, the term 
25 "pharmaceutically acceptable" means approved by a regulatory agency of the 
Federal or a state government or listed in the U.S. Pharmacopeia or other 
generally recognized pharmacqpeb for use in animals, and more particulariy in 
humans. The term "carrier" refers to a diluent, adjuvant, excipient, or vehicle 
with which the compound is administered. Such pharmaceutical carriers can be 
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sterile liquids, such as water and oils, including those of petroleum, animal. 

vegetable or synthetic origin, such as peanut oU, soybean oil, mineral oU, sesame 

oU and the like. Water or aqueous solution saline solutions and aqueous dextrose 

and glycerol solutions are preferably employed as carriers, particularly for 

5 injectable solutions. PharmaceuticaUy acceptable compositions of the invention are 

free of amounts of lipid A effective to cause a response in a mammaUan subject, 
in particular a human subject. 

The term "adjuvant" refers to a compound or mixture that enhances the immune 
re^nse to an antigen. An adjuvam can serve as a tissue dqwt that slowly 
10 releases the antigen and also as a lymphoid system activator diat non-specificaUy 
enhances the inunune response (Hood et al.. Immunology. Second Ed. , 1984. 
Benjamin/Cummmgs: Menlo Parte. California, p. 384). Often, a primary 
challei^ge with an antigen alone, m the absence of an adjuvant, will fail to elicit a 
humoral or cellular immune response. Adjuvants inchide, but are not limited to, 

15 complete Freund's adjuvam, incomplete Freund's adjuvant, saponin, mmeral gels 
such as ahmunum hydroxide, surface active substances such as lysolecithin, 
pluronic polyols, polyanions, peptides, oU or hydrocarbon emulsions, keyhole 
limpet hemocyanins, dinitrophenol, and potentially useful human adjuvants such as 
BCG (bacille Calmette-Guerin) and CoryneiKiaerium pannm. Preferably, the 

20 adjuvam is pharmaceutically accepuible. 

Isolation of Genes for Glvensvltransferases 

The present invention provides the full length coding sequence of the LOS locus of 
Neisseria, and thus, aUows for obtaming any one or all five genes, termed herein 
Igt genes, encoding glycosyltransferases characteristic of diat tocus. Any Neisseria 
25 bacterial ceU can potentially serve as the nucleic acid source for die molecular 
clonii^ of an Igt gene. In a ^ific embodiment, iitfra. the genes are isolated 
from Neisseria gononiioeae. The DNA may be obtained by standard procedures 
known in the art from cloned DNA (e.g. , a DNA "library"), by chemical 
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synthesis, by cDNA cloning, or by the cloning of genomic DNA, or fragments 
diereof, purified from the desired cell (See, for example, Sambnx)k et al., 1989, 
supra; Glover, D.M. (ed.). 1985, DNA Clonii«: A Practical Approach, MRL 
Press, Ltd., Oxford, U.K. Vol. I, II). For example, a N. gonorrhoeae genomic 
S DNA can be digested widi a restriction endonuclease or endonucleases, e.g. , 
&ztt3A, into a phage vector digested with a restriction endonuclease or 
endonucleases, e.g.^ AomHI/EcoRI, for creation of a phage genomic library. 
Whatever the source, the gene should be molecularly cloned into a suitable vector 
for propagation of the gene. 

10 In the molecular cloning of the gene from genomic DNA, DNA fragments are 
generated, some of which will encode the desired gene. The DNA may be 
cleaved at specific sites using various restriction enzymes. Alternatively, one may 
use DNAse in the presence of manganese to fragment the DNA, or the DNA can 
be physically sheared, as for example, by sonication. The linear DNA fragments 

15 can then be separated according to size by standard techniques, including but not 
limited to, agarose and polyacrylamide gel electrophoresis and column 
chromatography. 

Once the DNA fragments are generated, identification of the specific DNA 
fragment containing the desired Igt gene may be accomplished in a number of 

20 ways. For example, the generated DNA fragments may be screened by nucleic 
acid hybridization to the labeled probe synthesized with a sequence as disclosed 
herein (Benton and Davis, 1977, Science 196:180; Grunstein and Hogness, 1975, 
Proc. Natl. Acad. Sci. U.S.A. 72:3961). Those DNA fragments witfi substantial 
homology to the probe will hybridize. The present invention provides specific 

25 examples of DNA fragments that can be used as hybridization probes for 
glycosyltransferases, e.g., SEQ ID NO:l. 

As described above, the presence of the gene may be detected by assays based on 
the physical, chemical, or immunological properties of its e}q>ressed product. For 
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exaiiq>le DNA cIoms that produce a protein that» e.^., has similar or identical 
electrophoretic migration, isoelectric focusing behavior, proteolytic digestion 
maps, proteolytic activity, or functional properties, in particular 
glycosyltransferase activity the ability of a Lgt protein to mediate transfer of a 
5 sugar to an acceptor molecule. Alternatively, the putative lgt gene can be 
mutated, and its role as a glycosyltransferase established by detecting a variation 
in the structure of tte oligosaccharide of LOS. 

Alternatives to isolating the lgt genomic DNA include, but are not limited to. 
chemically synthesizing the gene sequence itself from a known sequence that 
10 encodes an Lgt, e.g., as shown in SEQ ID NO:L In another embodhnent, DNA 
for an lgt gene can be isolated PGR usuig oligonucleotide primere designed from 
the nucleotide sequences disclosed herein. Other methods are possible and within 
the scope of the invention. 

The identified and isolated gene can ttien be inserted into an appropriate cloning 
15 vector. A large number of vector-host systems known in the art may be used. 
Possible vectors include, but are not limited to, plasmids or modified viruses, but 
the vector system must be compatible with the host cell used. In a specific aspect 
of the invention, the lgt coding sequence is inserted in an £. coli cloning vector. 
Other examples of vectors include, but are not limited to, bacteriophages such as 
20 lambda derivatives, or plasmids such as pBR322 derivatives or pUC plasmid 
derivatives, e.g., pGEX vectors, pmal-c, pFLAG, etc. The insertion into a 
cloning vector can, for example, be accomplished by ligating the DNA fragment 
into a cloning vector which has conq>lemeiitaiy cohesive termini. However, if the 
complementary restriction sites used to fragment the DNA are not present in the 
25 cloning vector, the ends of the DNA molecules may be enzymatically modified. 
Alternatively, any site desired may be produced by ligating imcleotide sequences 
(linkers) onto the DNA termini; these ligated linkers may comprise specific 
chemically synthesized oligonucleotides encoding restriction endonuclease 
recognition sequences. In specific embodiment, PGR primers containing such 
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linker sites can be used to amplify the DNA for cloning. Recombinant molecules 
can be introduced into host cells via transformation, transfection, infection, 
electroporation* etc., so that many copies of the gene sequence are generated. 

Transformation of host cells with recombinant DNA molecules that incorporate the 
5 isolated Igt gene or synthesized DNA sequence enables generation of multiple 
copies of the gene. Thus, the gene may be obtained in large quantities by growing 
transformants, isolating the recombinant DNA molecules fix>m the transfonnants 
and, when necessaiy, retrieving the inserted gene from the isolated recombinant 
DNA. 

10 The present invention also relates to vectors containing genes encoding tnmcated 
forms of the em^me (fragments) and derivatives of Lgt*s that have the same 
functional activity as an Lgt. The production and use of fragments and derivatives 
related to an Lgt are within the scope of the present invention. In a specific 
embodiment, the fragment or derivative is functionally active, /.e., capable of 

15 mediating transfer of a sugar to an acceptor molecule. 

Truncated fragments of the glycosyltransferases can be prepared by eliminating N- 
terminaK C-terminal, or internal regions of the protein that are not required for 
functional activity. Usually, such portions that are eliminated will include only a 
few, e.g., between 1 and 5, amino acid residues, but larger segments may be 
20 removed. 

Chimeric molecules, e.g., fusion proteins, containing all or a functionally active 
portion of a glycosyltransferase of the invention joined to another protein are also 
envisioned. A glycosyltransferase fusion protein comprises at least a functionally 
active portion of a non-glycosyltransferase protein joined via a peptide bond to at 
25 least a functionally active portion of a glycosyltransferase polypeptide. The non- 
glycosyltransferase sequences can be amino- or carboxy-terminal to die 
glycosyltransferase sequences. Expression of a fusion protein can result in an 
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enzymaticaUy inactive glycosyltransferase fusion protein. A recombinant DNA 
molecule encoding such a fusion protein comprises a sequence encoding at least a 
functionally active portion of a non-glycosyltransferase protein joined in-fhune to 
the glycosyltransferase coding sequence, and preferably encodes a cleavage site for 
5 a specific protease, e.g. , thrombin or Factor Xa, preferably at the 

glycosyltransferase-non-glycosyltransferase juncture. In a specific embodiment, 
tte fusion protein may be aq>ressed in Esdierichia oM. 

In particular, Lgt derivatives can be made by altering encoding nucleic acid 
sequences by substitutions, additions or deletions tiiat provide for functionally 
10 equivalent molecules. Due to the degoieracy of nucleotide codn^ sequences, 
other DNA sequences which oicode substantially die same amino acid sequence as 
an lgt grae may be used m die practice of die present invention. These include 
but are not limited to nucleotide sequences comprising all or portions of lgt genes 
fliat are altered by die substitution of different codons that encode die same amino 

15 acid residue widiin die sequence, dius producing a silent change. Likewise, die 
Lgt derivatives of the invention include, but are not limited to. tiiose containing, 
as a primary amino acid sequence, all or part of die amino acid sequence of an 
Lgt including altered sequences in which functionally equivalent amino acid 
residues arc substitoted for residues widiin die sequence resulting in a conservative 

20 amino acid substitution. For example, one or more amino acid residues widiin the 
sequence can be substituted by anodier amino acid of a similar polarity, which acts 
as a functional equivalem. resulting in a silent alteration. Substimtes for an amnio 
acid within the sequence may be selected from odier monbers of the class to 
which die amino acid belongs. For example, die noqxihr (hydn^hobic) ammo 

25 acids inchide alanine, leucine, isoleucine, valine, jmilme. phenylalanine. 

QTptophan and mediionine. The polar neutral amino acids include glycine, serine, 
direonine. cysteine, tyrosine, asparagine. and glutamine. The positively charged 
(basic) amino acids incluite arginine, lysine and histidine. The negatively charged 
(acidic) amino acids include aspartic acid and glutamic acid. 
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The genes encoding Lgt derivatives and analogs of the invention can be produced 
by various methods known in the art (e.g. , Sambrook et ah, 1989, supra). The 
sequence can be cleaved at ^ropriate sites with restriction endonuclease(s), 
followed by fuitiier enzymatic modification if desired, isolated, and ligated in 
5 vitro. In tte production of the gene encoding a derivative or analog of Lgt, care 
should be taken to ensure that the modified gene remains within the same 
translational reading frame as the lgt gene, uninterrupted by translational stop 
signals, in the gene region where the desired activity is encoded. 

Additionally, the lgt nucleic acid sequence can be mutated in vitro or in vivo, to 
10 create and/or destroy translation, initiation, and/or termination sequences, or to 
create variations in coding regions and/or form new restriction endonuclease sites 
or destroy preexisting ones, to facilitate further in vitro modification. Any 
technique for mutagenesis known in the an can be used, including but not limited 
to, in vitro site-directed mutagenesis (Hutchmson, C, et al., 1978, J. Biol. Chem. 
15 253:6551; ZoUer and Smith, 1984, DNA 3:479-488; Oliphant et a!., 1986, Gene 
44:177; Hutchinson et al,, 1986, Proc. Natl. Acad. Sci. U.S.A. 83:710), use of 
TAB® linkers (Pharmacia), etc. PCR techniques are preferred for site directed 
mutagenesis (see Higuchi, 1989, "Using PCR to Engineer DNA", in PCR 
Technology: Principles and Applications for DNA Amplification, H. Erlich, ed., 
20 Stockton Press, Ch^ter 6, pp. 61-70). It is notable in this regard that the IgtA, 
IgtB, and IgtC genes contain long poly-G stretches that are particularly suscqitible 
to phase variation mutation. 

Expression of a Glycosyltransfiprase 



The gene coding for an Lgt, or a functionally active fragment or other derivative 
25 thereof, can be inserted into an appropriate expression vector, i.e. , a vector which 
contains the necessary elements for the transcription and translation of the inserted 
protein-coding sequence. An expression vector also preferably includes a 
replication origin. The necessary transcriptional and translational signals can also 
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be supplied by the native Igt gene and/or its flanking regions. A variety of host- 
vector systems may be utilized to express the protein-coding sequence. 
Preferably, however, a bacterial expression system is used to provide for high 
level expression of the protein with a higher probability of the native 
S conformation. Potential host-vector systems uiclude but are not lunited to 
m a mm a li a n cell systems infected with virus (e.g. , vaccinia virus, adenovirus, 
etc.); insect ccU systems infected with vims (e.g., baculovirus); micnx>rganisms 
such as yeast containing yeast vectors, or bacteria transformed with bacteriophage, 
DNA, plasmid DNA, or cosmid DNA. The expression elements of vectors vary 
10 in their strengths and specificities. Dqiending on the host-vector system utilized, 
any one of a number of suitable transcription and translation elements may be 
used. 

Preferably, the periplasmic form of the Lgt (containing a signal sequeiK:e) is 
produced for export of the protein to the Escherichia coli periplasm or in an 
15 expression system based on Bacillus subtiUis. 

Any of the methods previously described for the insertion of DNA fragments into 
a vector may be used to construct expression vectors containing a chimeric gene 
consisting of appropriate transcrq>tional/translational control signals and the 
20 protein coding sequences. These methods may include in vitro recombinant DNA 
and synthetic techniques and in vivo recombinants (genetic recombination). 

Expression of nucleic acid sequence encoding an glycosyltransferase or peptide 
fragment may be regulated by a second xnicleic acid sequence so that the 
glycosyltransferase or peptide is expressed in a host transformed with the 
25 recombinant DNA molecule. For example, expression of an glycosyltransferase 
may be controlled by any promoter/enhancer element known in the art, but these 
regulatory elements must be functional in the host selected for expression. For 
expression in bacteria, bacterial promoters are required. Eukaryotic viral or 
eukaryotic promoters, including tissue specific promoters, are preferred when a 
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vector containing an Igt gene is injected directly into a subject for transient 
expression, resulting in heterologous protection against bacterial infection, as 
described in detail below. Promoters which naay be used to control Igt gene 
expression include, but are not limited to, the SV40 early promoter region 
5 (Benoist and Chambon, 1981. Nature 290:304-310), the promoter contained in the 
3' long teiminal repeat of Rous sarcoma virus (Yamamoto, et al., 1980, Cell 
22:787-797). die herpes thymidine kinase promoter (Wagnn- et al., 1981, Proc. 
Nad. Acad. Sd. U.S.A. 78:1441-1445). die regulatoiy sequences of the 
metaUodiionein gene (Brinster et al., 1982. Nature 296:39-42); prokaiyotic 
10 expression vectors such as die /^-lactamase promoter (VUla-Kamaroff, et al.. 1978, 
Proc. Nad. Acad. Sci. U.S.A. 75:3727-3731), or die toe promoter (DeBoer, et 
al., 1983, Proc. Nad. Acad. Sci. U.S.A. 80:21-25); see also "Useful proteins 
from recombinant bacteria" m Scientific American, 1980, 242:74-94; and die like 

Bqnession vectors containing Igt gene inserts can be identified by four general 

15 approaches: (a) PCR amplification of die desired plasmid DNA or specific mRNA. 
(b) nucleic acid hybridization, (c) presence or absence of "marker" gene functions, 
and (d) expression of inserted sequences. In die first approach, die nucleic acids 
can be amplified by PCR widi incorporation of radionucleotides or stained widi 
ediidium bromide to provide for detection of die amplified product. In die second 

20 approach, die presence of a foreign gene inserted in an expression vector can be 
detected by nucleic acid hybridization using probes comprising sequences that are 
homologous to an inserted Igt gene. In die diird ^roach. die recombinam 
vector/host system can be idemified and selected based iqxm the presence or 
absence of certam "marker" gene fiinctions {e.g. . /3-galactosidase activity. PhoA 

25 activity, diymidine kinase activity, resistance to antibiotics, transformation 

phenoQ^. occlusion body formation in baculovirus, etc.) caused by die insertion 
of foreign genes in die vector. If die Igf gene is inserted widiin dK marter gene 
sequence of die vector, recombinants containmg the Igt insert can be idemified by 
die absence of die marker gene function. In die fourth approach, recombinant 

30 ei^ression vectors can be identified by assaying for die activity of die Igt gene 



WO96/100M 



PCT/DS9Sn2317 



27 

product expressed by the recombinant. Such assays can be based, for example, on 
the physical or fiinctional properties of the Igt gene product in in vitro assay 
systems, e.g. , glycosyltransferase activity. Once a suitable host system and 
growfli conditions are established, recombinant e>qnession vectors can be 
5 propagated and prq>ared in quanti^. 

Biosvnthesis of 01iposacclifl^H^^»f 

The glycosyltiansferases of the present invoition can be used in the biosynthesis of 
oligosacdiarides. The glycosyltransferases of the invention are cq»ble of 
stereo^iftc conjugation of a specific activated saccharide unit to a specific 

10 acceptor molecule. Such activated saccharides geiKrally consist of uridine, 
guanosine, and cytidine dipho^hate derivatives of die saccharides, in which the 
nucleoside diphosphate serves as a leaving group. Thus, the activated saccharide 
may be a saccharide-UDP, a saccbaride-GDP, or a saccharide-CDP. In specific 
embodiments, the activated saccharide is UDP-GlcNAC, UDP-GalNAc, or UDP- 

15 Gal. 

The term "acceptor molecule" as used herein refers to the molecule to which the 
glycosyltransferase transfers an activated sugar. As is well known in the art. 
synthesis of carbohydrates proceeds by sequential coupling of sugar residues to a 
lipid, e.g., dolichol pho^hate. In eukaiycHic cells, which glycosylate proteins, the 

20 oligosaccharide or polysaccharide is transferred ftom the activated lq>id earner to 
the polypeptide on die lummal side of the endoplasmic reticulum. In prokaiyotes. 
the carbohydrate can be synthesized direcUy on a lipid A molecule. It is likely 
diat the glyco^Itransfeiases of die invemion may be sensitive to the core, portion 
of die growing carbohydrate and die lipid molecule. Thus, in a preferred aspect. 

25 die accqitor molecule, ot carrier, contains a lipid, preferably a polyisoprenoid 
alcohol iipid such as dolichol phosphate. Maximum ^nthedc efficiency may 
ensue from use of lipid A as the carrier. While the lipid A is not useful as a 
carrier for direa administration of the resulting oligosaccharide to a subject, e.g. . 
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as a vaccine preparation* it may be ai^ropriate for use with a labile linkage for 
subsequent cleavage (under mild conditions) and separation of the oligosaccharide 
from tbt lipid carrier. It should further be noted that the glycosyltransferases will 
only work efficiently to add a specific activated saccharide to a saccharide residue 
S on the acceptor molecule that corresponds to the natural acceptor molecule. For 
example, LgtE catalyzes transfer of Gal to Glc/?l-»4Hep. Thus, where a 
glycosyltransferase mediates attachment of GalNAc to Glc, the nature of the Glc 
residue (whether it is attached directly or indirectly to the carrier, for example) 
will affect the reaction efficiency. It is unlikely that efficient synthesis can occur 
10 in the absence of a carrier, or using otter than a lipid carrier. However, even 
inefficient synthesis may be desirable, and practice of the present invention is not 
limited to use of acceptor molecules contaiiung lipids, but extends to saccharides, 
polysaccharides, polypeptides, glycoproteins, and the lite. 

For the synthesis of an oligosaccharide, a glycosyltransferase is contacted with an 
IS appropriate activated saccharide and an appropriate acceptor molecule under 

conditions effective to transfer and covalently bond the saccharide to the acceptor 

molecule. Conditions of time, temperature, and pH appropriate and optimal for a 

panicular saccharide unit transfer can be determined through routine testing; 

generally, physiological conditions will be acceptable. Certain co-reagents may 
20 also be desirable; for example, it may be more effective to contact the 

glycosyltransferase with the aaivated saccharide and the acceptor molecule in the 

presence of a divalent cation. 

According to the invention, the glycosyltransferase enzymes can be covalently or 
nonHX>valently immobilized on a solid phase siqiport such as SEPHADEX, 
25 SEPHAROSE, or poly(acrylamide-co-N-acryloxysuccimiide) (PAN) resin. A 
specific reaction can be performed in an isokited reaction solution, with facile 
separation of tte solid phase emymc from the reaction products. Immobilization 
of the enzyme also allows for a continuous biosyntbetic stream, with the specific 
glycosyltransferases attached to a solid support, with the supports arranged 
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randomly or in distinct zones in die specified order in a column, with passage of 
the reaction solution through the cohimn and elution of the desired oligosaccharide 
at the end. An efficient method for attaching the glycosyltransferase to a solid 
support and using such immobilized glycosyltransferases is described in U.S. 
5 Patent No. 5,180,674, issued January 19, 1993 to Roth, which is specifically 
incorporated herein by reference in its entirety. 

An oligosaccharide, e.g.^ a disacdiaride, prepared usit^ a glycosyltransferase of 
the present invention can serve as an acceptor molecule for furdier synthesis, 
either using other glycosyltransferases of the invention, or glycosyltransferases 

10 known in the art (see, e.g.. Roth, U.S. Patent No. 5.180,674, and Roth, 

International Patent Publication No. WO 93/13198, published 8 July 1993, each of 
which is incorporated herein by reference in its entireQf). The oligosaccharide 
compositions of the invention are useful in a wide variety of therapeutic and 
diagnostic applications. For example, the saccharide compositions can be used as 

15 blocking agents for cell surface receptors in the treatment of numerous diseases 
involving cellular adhesion. Alternatively, saccharide compositions useful as 
nutritional supplements, antibacterials, anti-metastases agents, anti-inflammatory 
agents (e.g., for binding to inflanunatory-associated lectins or cell surface 
receptors), to mention but a few, are contemplated by the instant invention. As 

20 noted above, the glycosyltransferases of the invention can be used in conjunction 
with other glycosyltransferases known in the art or to be discovered to synthesize 
complex oligosaccharides or polysaccharides. 

Alternatively, the glycosyltransferases of the invention can be used to synthesize 
oligosaccharides representative of the oligosaccharides found on various strains of 
25 Neisseria. For example, by deleting open reading frames from the locus, or by 
selecting only a few of the glycosyltransferases of the invention for synthesis, 
alternative oligosacdiaride structures can be prepared. These can be used m 
vaccine preparations effective against Neisseria variants, in particular, subunit 
vaccines against gonococcus and meningococcus. 
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Alternatively, the glycosylttansferases of the present invention can be used to 
prepare oligosaccharides corresponding to oligosaccharides associated with human 
glycolipids. Thus, in specific embodiments, the present invention provides for 
synthesis of an oligosaccharide corresponding to lacto-N-neotetraose of the 
5 sphingolipid paragloboside; an oligosaccharide that mimics gangliosides; and a 
mimic of the saccharide portion of globoglycolipids, which is the structure 
characteristically found in Neisseria meningitidis inmiunotype LI. The 
oligosaccharides of the present invention correspond to the core oligosaccharides 
of the blood groiq> antigens, and therefore have great utility in the preparation of 
10 such blood group antigens for diagnostic or therapeutic purposes. 

Accordingly, a method for preparing an oligosaccharide having the strucmre 
GalNAcj81-^3Gal^l-wK}lcNAc/31-3Gal/31-^lc (i.e., ganglioside) comprises 
sequentially performing the steps of: 

a. contacting a reaction mixture comprising an activated Gal to an 
15 acceptor moiety comprising a Glc residue in the presence of a 

glycosyliransferase having an amino acid sequence of SEQ ID NO: 6, or a 
functionally active fragment thereof; 

b. contacting a reaction mixture comprising an activated GlcNAc to the 
acceptor moiety comprising a Gali31-*4Glc residue in the presence of a 

20 glycosyliransferase having an amino acid sequence of SEQ ID NO:3, or a 

functionally active fragment thereof; 

c. contacting a reaction mixture comprising an activated Gal to the 
acceptor moiety comprisiiig a GlcNAcj?l-^3Gal/?l-i4Glc residue in the 
presence of a glycosyltransferase having an amino acid of SEQ ID NO:8; 

25 and 

d. contacting a reaction mixture comprising an activated GalNAc to the 
acceptor moiety comprising a Galj31-*4GlcNAc/Sl-»3Gal/31-wtGlc residue in 
the presence of a glycosyltransferase having an amino acid sequence of 
SEQ ID NO:5, or a functionally active fragment thereof. 
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Similarly, a method for preparing an oligosaccharide having the structure 
Gal/81-i4GlcNAci91-^3GaliJl-^lc (Le. , lacto-N-neotetraose) comprises 
sequentially perfonning the steps of: 

a. contacting a reaction mixture comprising an activated Gal to an 
S acceptor moiety conq)rising a Glc residue in tfie presence of a 

glycosyltransferase having an amino acid sequence of SEQ ID NO: 6, or a 
functionally active fragment thereof; 

b. contacting a reaction mixture comprising an activated GlcNAc to the 
accqitor moiety comprisir^ a Gali91-i^lc residue in the presence of a 

10 glycosyltransferase having an ammo acid sequence of SEQ ID NO:3« or a 

functionally active fragment thereof; and 

c. contacting a reaction mixture comprising an activated Gal to the 
acceptor moiety comprismg a GlcNAc^l->^3Gal/91->4Glc residue in the 
presence of a glycosyltransferase having an amino acid of SEQ ID NO:8. 

IS In another embodiment, a method for preparing an oligosaccharide having the 
structure Galal-*4Gal/31-**Glc (i.e., globoglycolipids) comprises sequentially 
performing the steps of : 

a. contacting a reaction mixture comprising an activated Gal to an 
acceptor moiety comprising a Glc residue m the presence of a 

20 glycosyltransferase having an amino acid sequence of SEQ ID NO:6, or a 

functionally active fragment thereof; and 

b. contactii^ a reaction mixture comprising an activated Gal to the 
acceptor moiety comprising Gal^l-»4Glc in the preseixre of a 
glycosyltransferase having an amino acid sequence of SEQ ID NO:4, or a 

25 functionally active fragment thereof. 

Such oligosaccharides can be prepared using lipid A as a carrier. Preferably, if 
the resultmg glycolipid is to be used m a vaccine, a non-toxic lipid, such as 
dolichol phosphate, is used as the carrier. 
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Vaccination 

Active immunity against Neisseria strains can be induced by immunization 
(vaccination) witb an immunogenic amount of an oligosaccharide prepared 
according to the present invention in admixture with an adjuvant, wherein the 
5 oligosaccharide is the antigenic conqx)nent of the vaccine. Piefeiably, the 

oligosaccharide is conjugated to a carrier protein. Alternatively, where the antigen 
is a glycolipid, it can be incorporated in a liposome. 

The oligosaccharide alone cannot cause bacterial infection, although the 
oligosaccharide on lipid A is toxic, and the active inummity elicit by vaccination 
10 according to the present invention can result in immediate immune response. 

Selection of an adjuvant depends on the subject to be vaccinated. Preferably, a 
phannaceutically acceptable adjuvant is used. For example, a vaccine for a human 
should avoid oil or hydrocarbon emulsion adjuvants, including complete and 
incomplete Freund's adjuvant. One example of an adjuvant suitable for use with 
15 humans is alum (alumina gel), A vaccine for an animal, however, may contain 
adjuvants not appropriate for use with humans. 

A vaccine of the invention, i.e. , a vaccine comprising an oligosaccharide 
corresponding to an antigenic determinant on a strain of Neisseria, can be 
administered via any parenteral route, including but not limited to intramuscular, 
20 intraperitoneal, intravenous, and the like. 

Administration of an amount of a Neisseria oligosaccharide sufficient to inhibit 
adhesion of the bacterium to its target cell may also be effective for treating 
meningococcal or gonococcal infection. The required amount can be determined 
by one of ordinary skill using standard techniques. 
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Expression of GlycosY^trynsferas es in for Intracellular Glycosvlation 

The present invention further contemplates transforming a host cell with a 
glycosyltransferase or glycosyltransferases of the invention. It is expected that 
e;q)ression of the glycosyltransferase. possibly in a cell lacking one or more 
5 endogenous glycosyltransferases, may result in novel glycosylation of lq>ids and 
proteins in such eukaryotic cells, and novel glycosylation of lipids in procaryotic 
cells. 

For exanqile, tranrformation of a bacterium with non-toxic lipid molecules may 
provide for expression of Neisseria oligosaccharides on such a bacterium, which 
10 can xhtn be used dunecdy in a whole cell vaccine. 

Alternatively, expression of such a glycosyi transferase in yeast, insect, or 
mammalian cell lines may result in novel glycosylation of lipids and proteins 
expressed by these cells. 

Antibodies to Neisseria Olig osaccharides, and Diagnosis and Theranv Therewith 

15 Just as the oligosaccharides can be used in vaccines, so to they can be used to 
generate antibodies to themselves, which antibodies, in turn, can be used to detect 
that particular strain of bacteria or for passive immunity. Antibodies include but 
are not limited to polyclonal, monoclonal, chimeric, smgle chain. Fab fragments, 
and an Fab expression library. Various procedures known in the art may be used 

20 for the production of polyclonal antibodies to oligosaccharide. For the production 
of antibody, various host animals can be immunized by injection with the 
oligosaccharide, including but not limited to rabbits, mice, rats, steep, goats, etc. 
In one embodiment, the oligosaccharide can be conjugated to an immunogenic 
carrier, e.g., bovme serum albumin (BSA) or keyhole limpet hemocyanin (KIM). 

25 Various adjuvants may be used to mcrease the immunological response, depending 
on the host species. For preparation of monoclonal antibodies directed towaid the 
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oligosaccharide, or fragment, analog, or derivative thereof, any technique that 
provides for the production of antibody molecules by continuous cell lines in 
culnire may be used. These include but are not limited to the hybridoma 
technique originally developed by Kohler and Milstein (1975. Nature 256:495- 
5 497), as well as the trioma technique, the human B-cell hybridoma technique 
(Kod)or et al., 1983» bnmunolpgy Today 4:72), and the EBV-hybridoma 
technique to produce human monoclonal antflxxlies (Cole et al.« 1985, in 
Monoclonal Antibodies and Cancer Therapy, Alan R. liss. Inc., pp. 77-96). In an 
additional embodiment of the invention, monoclonal antibodies can be pnxluced in 

10 germ-free animals utilizing recent technology (PCT/US90/02545). Accordmg to 
die invention, human antibodies may be used and can be obtained by using human 
hybridomas (Cote et al., 1983, Proc. Natl. Acad. Sci. U.S.A. 80:2026-2030) or 
by transformmg human B cells with EBV virus in vitro (Cole et al., 1985, in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, pp. 77-96). In fact, 

15 according to the invention, techniques developed for the production of "chimeric 
antibodies" (Morrison et al., 1984, J. Bacteriol. 159-870; Neuberger et al., 1984, 
Nature 312:604-608; Takeda et al., 1985, Nature 314:452-454) by splicing the 
genes from a mouse antibody molecule specific for an oligosaccharide together 
with genes from a human antibody molecule of appropriate biological activity can 

20 be used; such antibodies are within the scope of this invention. Such human or 
humanized chimeric antibodies are preferred for use in therapy of human diseases 
or disorders, since the human or humanized antibodies are much less likely than 
xenogenic antibodies to induce an imnuine response, m particular an allergic 
response, themselves. Accordmg to the invention, techniques described for the 

25 production of single chain antibodies (U.S. Patent 4,946,778) can be adapted to 
produce oligosacdiaride-specific sii^le chain antibodies. An additional 
embodiment of the invention utilizes die techniques described for die construction 
. of Fab expression libraries (Huse et al., 1989, Science 246:1275-1281) to allow 
TzpiA and easy identification of monoclonal Fab fragments with the desired 

30 q)ecificity for an oligosaccharide, or its derivatives, or analogs. 
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Antibody fragments which contain the idiotype of the antibody molecule can be 
generated by known techniques. For example, such fragments include but are not 
limited to: the F(ab')2 fragment which can be produced by pepsin digestion of the 
anUbody molecule; the Fab' fragments which can be generated by reducing the 
5 disulfide bridges of the F(ab'h fngment, and the Fab fragments which can be 
gomated by tieating die antibody molecule with p^am and a reducing agent. 



In the production of antibodies, screening for the desired antibody can be 
accomplished by techniques known in the art, eg., ladioimmunoassay. ELISA 
(oizyme-linked immunosoibant assay), 'sandwich" hnmunoassays, 
10 immunoiadiometric assays, gel diffusion precipitin reactions, iimnunodifiusion 
assays, in situ immunoassays (using colloidal gold, enzyme or radioisotope labels, 
for exanqile), western blots, precq)itation reactions, agghitination assays (e.g. . gel 
agghitination assays, hemagglutination assays), complement fixation assays, 
immunofluorescence assays, protein A assays, and nnmunoelectrophoresis assays, 

15 etc. In one embodiment, antibo<ty bmding is detected by detecting a label on die 
primary antibody. In anotiier embodiment, the primary antibody is detected by 
detecting binding of a secondary antibody or reagent to the primary antibody. In a 
further embodiment, the secondary antibody is labeled. Many means are known in 
die art for detecting binding in an immunoassay and are widiin the scope of the 

20 presem invention. For example, to select antibodies which recognize a specific 
oligosaccharide, one may assay generated hybridomas for a product which binds to 
an oligosaccharide coniainmg such epitope. For selection of an antibody specific 
to an oligosaccharide fhrni a particular species or strain of Neisseria, one can 
select on die basts of positive bindii^ with oligosaccharide expressed by or 

25 isolated from cells of diat species or strain. 



The foregoing antibodies can be used in methods known in die art relating to die 
localization and activity of the oligosaccharide, e.g., for Western blotting, imaging 
oligosaccharide in situ, measuriqg levels thereof m appropriate physiological 
samples, etc. 
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Diagnosis of infection with a Gram positive bacterium can use any inununoassay 
fonnat known in the art, as desired. The antibodies can be labeled for detection in 
vitro^ e.g., with labels such as enzymes, fluoropbores, chromophores, 
radioisotopes, dyes, colloidal gold, latex particles, and chemiluminescent agents. 
5 Alternatively, the antibodies can be labeled for detection in vivo, e.g. , with 
radioisotq[ies (preferably technetium or iodine); magnetic resonance shift reagents 
(such as gadolinram and manganese); or radio-opaque reagents. 

Alternatively, the nucleic acids and sequences thereof of the invention can be used 
in the diagnosis of infection with Neisseria, in particular, to identify a particular 

10 stram, or to determine which, if any, of the glycosyltransferase genes are mutated. 
For example, the Igt genes or hybridizable fragments thereof can be used for in 
situ hybridization with a sample from a subject suspected of harboring an mfection 
of Neisseria bacteria. In another embodiment, specific gene segments of a 
Neisseria can be identified using PCR amplification with probes based on the Igt 

15 genes of the invention* In one aspect of the invention, the hybridization with a 
probe or with the PCR primers can be performed under stringent conditions, or 
with a sequence specific for a unique strain or a limited number of strains of the 
bacterium, or both, thus allowing for diagnosis of infection with that particular 
strain (or strains). Alternatively, the hybridization can be under less stringent 

20 conditions, or the sequence may be homologous in any or all strains of a 
bacterium, thus allowing for diagnosis of infection with that species. 

The present invention will be b^r understood from a review of the following 
illustrative description presenting the details of the constmcts and procedures that 
were followed in its development and validation. 

25 EXAMPLE 



This Example describes a locus in Neisseria gonorrhoeae strain F62 contaming 
five genes. Four of the genes are responsible for the sequential addition of the 
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GalNAc^l-3Gal^l-^GlcNAc/Sl-*-3GaIi51-*4 to the substrate GIcj81-*4Hep-^R of 
the inner core region (Yamasaki et al.. 1991, Biochemistry 30:10566). The fifth 
gene is involved with the addition of the a-linked galactose residue in the 
biosynthesis of the alternative LOS structure Galal-«4GaliSl-*4GlciSl-^Hep-^R 
5 (John et al., 1991, J. Biol. Chem. 266:19303). The DNA sequence analysis 
revealed that the first* third and fourth reading frames contained poly-<j tracts 
which in strain F62 were respectively 17. 10 and 11 bp. Thus, ttiree of die LOS 
biosynthedc enzymes are potentially susceptible to premature termination by 
reading-frame changes, as has been reported for the gonococcal pilC genes 
10 (Jonsson et al.. 1991. EMBO J. 10:477; Rudel et al.. 1992. Molec. Microbiol. 
6:3439). It is likely that these structural feamres are responsible for the high- 
frequency genetic variation of gonococcal LOS (Schneider et al.. 1988. Infect. 
Immun. 56:942). 

Materials and Methods 

15 Reagents and chemicals. Most laboratory chemicals were obtained ftx)m Signia 
Chemical Co (St. Louis, MO). Restriction en^mes were purchased from New 
England Biolabs (Beverly. MA). 

Media and growth conditions. E. coli strains were grown in solid or liquid LB 
medhun (Sambrook et al.. 1989. Cold Spring Harbor Laboratory Press. Cold 

20 Spring Harbor); antibiotics were added as applicable. Carbenicillin was used at 50 
fAg/ml and erythromycm at 200 /ig/ml. Neisseria gonorrhoeae strain F62 was 
grown on GC agar (Swanson, 1978. Infea. Immun. 19:320) or GC agar 
containing 2 /ig/ml erythromycin. For isolation of LOS or genomic DNA. 
gonococci were grown in 1.5% proteose peptone broth (Difco Laboratories, 

25 Detroit MI). 30 mM phosphate. 8.5 mM NaCl supplemented with 1 % isovitalex 
(Becton Dickinson Microbiology Systems. Cockeysville. MD). 
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Recombinant DNA methods. Plasmids were purified using either Qiagen columns 
or the QIAprep spin columns obtained from Qiagen Inc. (Chatsworth, CA). 
Digestion with restriction enzymes, gel electrophoresis, ligations with T4 DNA 
polymerase and transformation of E. coU were done according to Sambrook et al. 
5 (Sambrook et al. , 1989, Cold Spring Harbor Laboiatoiy Press. Cold Spring 
Harbor). Southern hybridization was performed on Hybond N+ membranes 
Amersham Co. (Arlington Heights. IL) with DNA labeled usmg the ECL kit from 
Amersham Co. Genomic DNA was isolated as described by Moxon et al. (Moxon 
et al.. 1984, J. Oin. Invest. 73:298). 

10 A gene bank of Neisseria gonorrhoeae strain F62 genomic DNA was constructed 
by ligatiiig ca 20 kb fragments obtained by mcomplete digestion with SaUiA into 
BanOWEcom digested X2001 (Kam et al.. 1984. Gene 32:217). The phage 
library was screened by hybridization with random-primer-labeled plasmid 
pRIOPI. and 5 clones were isolated by plaque purificaUon. The phage from these 

15 clones were purified by sedimentation followed by flotation on CsCl (Davis et al.. 
1980, Cold Spring Harbor Laboratory. Cold Spring Harbor, NY), and the DNA 
was isolated. From one of diese clones, two Clal fragments of 4.9 and 3.4 kb 
were isolated by gel electrophoresis and recovery with Geneclean II (BIO 101 
Inc., La Jolla. CA). These were ligated into Ctel cut pBluescript H SK- from 

20 Stratagene (La JoUa. CA) and called p4900 and p3400 respectively. p4900 
contained a Pstl site in the insert and was subdivided into two clones containing 
inserts of 2.1 and 2.8 kb. The clone containing the 2.8 kb insen was caUed 
pPstCla. The insorts m p3400 and pPstCb were sequenced by the chain 
termination mediod (Sanger et al.. 1977, Proc. Natl. Acad Sci. USA 74:5463) 

25 using Sequenase H, (United States Btochenucal Co., Cleveland. OH). AU of the 
sequeijce presented in Figure 2 was completed in both directions. 

The msertion and deletions shown in Figure 6 were constructed as follows. II. 13. 
Al and A2 used plasmid pPstCIa cut respectively with BsaBl, Ascl, Styl and 
double cut with Slryl and BsaBl. 12 and A3 used plasmid p3400 cut with Agel or 
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StyL The complete locus was assembled by cloning the Clal - Apal fragment from 
p3400 into pPstCla cut with Clal and Apal, and the plasmid was called pLOS5. 
Deletions A4 and A5 were constructed using pLOS5 and digestion with Styl and 
Bbsl or with Styl alone. In all instances (except digestion with AsaBI) the cut 
5 plasmids were treated with the Klenow fragment of E. coU DNA polymerase to 
blunt the ends, and ermC (erythromycm resistance marker) was inserted. The 
ermC gene was isolated from plasmid pIM13 (Projan et al.. 1987. J. BacterioL 
169:5131) as a CZoI - Hindm fragment aid cloned into the same sites in plasmid 
pHSS6 (Scifcrt et al., 1986. Ptoc. Natl. Acad. Sci. USA 83:735). From this 
10 plasmid k was excised as a Notl fragment, the ends blunted by treatment with 
Klenow fragment of DNA polymerase, purified by gel electrophoresis and 
recovery with Geneclean U. 

Transformation of piliated Neisseria gonorrhoeae strain F62 was performed with 
plasmids isolated from E. coli (Klugman et al.. 1989, Infect. Immun. 57:2066) 

15 and the transformants selected on GC agar (Swanson, 1978, Infect, hnmun. 

19:320) contaimng 2 /xg/ml erythromycin. The fidelity of the genomic alteration 
of each of the gonococcal transformants was verified by sequencing the upstream 
and downstream junctions of the ermC gene in their genomic DNA using a PCR 
technique. Two 5' biotinylated primers, GCCGAGAAAACTATTGGTGGA 

20 (SEQ. ID, N0:9) and AAAACATGCAGGAATTGACGAT) (SEQ. ID. NO: 10), 
were synthesized; these were based on the ermC sequence near its upstream and 
its downstream end respectively. The primers were designed such that their 3' 
ends pointed outward from the ermC gene. Each of these primers was used 
together witfi a suitable primer matchmg the sequence of the LOS locus near the 

25 putative insertion. PCR was performed according the instructions supplied with 
the GeneAnq) PCR Reagent Kh from Perldn Ehner (Branchburg, NJ) using 25 
cycles. In all instances the expected size product was obtained. The DNA 
sequence of diese products was determuied by purifying the PCR product on 
magnetic streptavidin beads from Dynal, Inc. (Lake Success, NY) and sequencing 

30 with the Sequenase n kit according to a protocol provided by Dynal, Inc., based 



wo 96^0086 



PCTAJS9S/12317 . 



40 

on the method developed by Hultman et al (Hultman et al., 1989, Nucleic Acids 
Res. 17:4937). The sequences were analyzed by computer programs in the GCG 
package of Genetics Conqniter Group. Inc. (Madison, WI). 

Immunological methods. Monoclonal antibodies 17-1-Ll (LI), 9-2-L378 (L3), 2- 
5 1-L8 (L8) were obtained as filtered ascites fluids. Antibody 1-1-M was obtained 
as ascites fluid and 3F11 and 4C4 were obtained as tissue culture supeniatants. 
LOS was extracted from each of the ^nococcal mutants by die hot phenol-water 
method (Wes^bal and Jann, 1965. Academic Press, New Yoik 83-91) and 
purified as described (Johnston et al., 1976. J. Exp. Med. 143:741). The LOS 

10 was dihited to 200 ^ml in the Western blot buffer described by Towbin et al. 
(Towbin et al., 1979. Proc. NaU. Acad. Sci. USA 76:4350), and 1.5 ^1 aliquots 
were spotted on Immobilon-P membrane from MUlipore Coip (Bedford, MA) that 
was lying on 3MM Whatman filter paper (Whatman Ltd.. Maidstone. Rigland) 
soalced in the blotting buffo-. The spots were allowed to absorb into the 

15 membrane over a period of 2 min and the strips were placed in blocking buffer for 
at least 60 min. The blocking buffer consisted of 3% gelatin dissolved in 150 mM 
NaCl, 10 mM Tris-HCI 10 mM pH 7.5, 5 mM MgClz, 0.02% NaNj. The strips 
were washed thrice in the same buffer containing 1 % gelatin. The strips were 
treated for 2 h with monoclonal antibodies diluted in blocking buffer. The 

20 antibodies available as ascites fluids were diluted 1/1000, antibodies available as 
tissue culture supematants 1/10. The strips were washed, incubated for 60 min 
with a 1/1000 dilution of phosphatase-conjugated anti-IgG,IgA.IgM firom Cappel 
(Oiganon Teknika Co.. West Chester, PA), washed and stained as described 
previously (Blake et al.. 1984, Analyt. Biochem. 136:175). 

25 Get elean^horesis. Gel electrt^horesis of LOS samples was performed as 

described by Lesse et al (Less et al.. 1990, J. Immunol. Meth. 126:109) and the 
gels silver stained (Hitchcodc and Brown. 1983. J. Bactniol. 154-269). 
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Results 



Cloning of the LOS Locus, During attempts to isolate the porin gene of Neisseria 
gonorrhoeae, pBR322 clones containing a 4.9 kb Qal fragment that reacted by 
colony blots with a rabbit antisenmi to purified porin were repeatedly isolated. 
5 An immunoreactive subclone, pRlOPI. consisting of a 1305 bp Rsal-dal fragment 
was derived and its DNA sequence was deteimined. This sequence had homology 
to a gene isolated fixnn HaanaphOus irfiuenme called lex-l (Cope et al., 1991, 
Molec. Microbiol. 5:1113) or Uc2A (High et al.. 1993, Molec. Microbiol. 9:1275) 
that is known to be involved in LPS synthesis of that species. Using subclone 
10 pRlOPI as a probe. Soudiem blots of Nasseria gonorrhoeae genomic DNA 
digested with CUA revealed hybridization widi two fragments, 4.9 and 3.4 kb. 
However, digestion with some other restriction enzymes gave rise to only a single 
band. Notably, digestion with ^a\ gave rise to a single band of 4.1 kb. 
suggesting that die two copies were closely linked (data not shown). 

15 A X2001 bank of Neisseria gonorrhoeae strain F62 DNA was screened by 

hybridization with pRlOPI and 5 clones were isolated. One of these clones, when 
digested with either Ctel or Bfal and examined by SouAem hybridization using 
pRlOPI as die probe, gave rise to a pattern identical to diat seen with genomic 
DNA. The appropriate Oal fragments of diis X2001 clone were isolated and 
cloned imo UK CUA site of pBluescript O SK-. The entire sequence of the 3400 
C/fll fragment was determined. Mapping of die clone containing die 4900 bp Clal 
fragment indicated diat fliere was a single Pstl site in die clone about 2.8 kb from 
one side, allowing die clone to be divided into two subclones. Partial sequence of 
die ends of die 2.1 kb subclone indk:ated diat it contamed a coding frame 
25 homologous to die E. coU COOH-terminal portion of die a subunit of 

glycyl-tRNA syndietase (g(yS) and die majority of die /S subunit of tiiis gene 
(Webster et al., 1983. J. Biol. Chem. 258:10637). The predicted lengdi of DNA 
needed to match die E. cott sequence was present; diis clone was not examined 
fiuther. 



20 



PCT/DS95/12317 



42 

DNA Sequence of the LOS Locus. A summary of the features foimd by sequencing 
the two clones is illustrated in Figure 2. Following the gfyS gene were found five 
closely spaced open reading frames. The last frame has 46 bp downstream of the 
termination codon a sequence typical of a rho independent termination signal. 
S Subsequently, there is an area of ca 100 bp that has striking homology to the 
IS1106 neisseria! insertion sequence (Knight et al., 1992, Molec. Microbiol. 
6:1565). Further elucidation of the nature of this locus, presented below, showed 
the five open reading frames code for JLX)S glycosyl transferases and hence they 
have been named IgtA - IgtE. 

10 Searches for internal homology within this locus indicates that the DNA coding 
for the first two genes (IgtA, IgtB) is repeated as the fourth and fifth genes {IgtD, 
IgiE) and that interposed is an additional open reading frame, IgtC. This is in 
keeping with the data (4)tained by Southern hybridization presented above, in 
which pRlOPI probe containing the IgtB and a small portion of the IgtC gene 

15 hybridized with two Clal fragments, but with only one Efal fragment (see 

positions of the Bfal sites in the LOS locus in Figure 2). In more detail, 16 bp 
following the stop codon of the tRNA synthetase (glyS) is the beginning of a stem 
loop structure followed closely by a consensus ribosome binding site (rbs), and 
within 6 bp is a TTG believed to be the initiation codon of IgtA. 2871 bp 

20 downstream from the beginning of the stem loop (closely following the stop codon 
of igtQ there is an ahnost perfect rqpeat of the stem loop structure, the rbs, and 
the TTG initiation codon of IgtD^ with the downstream sequence strongly 
homologous for about 500 bp. The sequences then diverge to some extent. 
However, at the beginning of IgtB and IgtE the homology again becomes nearly 

25 perfea for ca 200 bases to then diverge toward the latter part of the orfs. The 
simikuity of the homologous proteins is illustrated in Figures 3 and 4. These 
comparisons, demonstrate the near-perfect conservation of the primary structure in 
the N-terminal portions of the molecules with increasing divergence toward the 
COOH-termini of the proteins. 
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The tgtC sequence mterposed between the repeated portions of the locus is not 
repeated within the locus or in the Neisseria gonorrhoeae genome (data not 
shown). It appears to be homologous to £. coli rfal or rfaj genes, which are very 
closely related genes that serve as ghicosyl transferases in core LPS biosynthesis 
5 (Pradel et al., 1992, J. Bacteriol. 174:4736). The simihirity of ffal with IgiC is 
illustrated in Figure 5. 

It was found that three of these genes contained within their coding frame runs of 
guanosines coding for stretches of glycines (see Figure 2). These poly-G regions 
were found in IgU (17 bp), /grC(10 bp) and IgtD (11 bp); in each case the 
10 number G residues was one that maintained an intact reading frame (see Figures 3 
and 5). In each of the three genes a change of 1 or 2 G bases would cause 
prraiature termination of the transcript. 

LOS phenotype of Neisseria gonorrhoea F62 with deletions of the LOS locus. In 
order to define the function of the Igt genes, insertions or deletions of the LOS 

15 locus were constructed in plasmids propagated in £. coli. The insertions or 
deletions in each case were marked with the ermC gene, which is an excellent 
selective marker in Neisseria gonorrhoeae (Klugman et al., 1989, Infect. Immun. 
57:2066). The constructions arc summarized in Figure 6. II, 12 and 13 refer to 
insertions of the ermC marker into, respectively, a BsaBI, Agel and AscI site. 

20 Similarly, the deletions were constructed by excismg portions of the plasmids and 
substituting the erythromycin marker. The open arrows indicate the gene or genes 
disrupted. Each of these plasmids was used to transform Neisseria gonorrhoeae 
strain F62 and transformants were selected on erydiromycin-containing plates. 
The fideliQr of the genomic alteration of a prototype of each of the gonococcal 

25 transformants was verified by sequencing the upstream and downstream junction of 
the ermC gene. To sinq[>lify flie nomenclature in this report the gonococcal 
mutants have been given the same names used to identify the plasmid constructs in 
Figure 6. 
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The LOS of the mutants were examined by SDS-PAGE and compared to the LOS 
of strain 1291e. This strain was originally isolated by Dudas and Apicella (Dudas 
and Apicella, 1988. Infect. Immun. 56:499) as a pyocm-rcsistant mutant of strain 
1291 wild type and has been extensively characterized bodi chemically and 
5 genetically. Chemical analysis has shown that this mutant lacks completely die 
lacto-N-neotetraose substitution on heptose 1 (John et al.. 1991, J. Biol. Chem. 
266:19303). The graetic basis of this mutant has been defined (Zhou ct al., 1994. 
J. Biol. Chem. 269:11162; Sandlin and Stein, 1994, J. Bacteriol. 176:2930); it is 
a mutation of the pffn gene coding for phosphpghicomutase. This mutation 

10 prohibits the synthesis of UDP-glucose and hence the addition of ghicose to the 
heptose. As seen in Figure 7, the parental wild type F62 strain gives rise to two 
major LOS bands; their appearance is indistinguishable from SDS-PAGE patterns 
previously published by other worirers (Schneider et al., 1985, Amer. Soc. 
Microbiology, Washington 400-40S). The mutants are arranged on the gel 

15 according to the size of the major band that they contain. The size decreases from 
the top band of the F62 wt LOS in four clear steps to the size of the LOS of A4 or 
12. Since the 12 mutant (with an insertion into IgtE, the last gene in the locus) has 
the same phenotype as A4 (which has a complete deletion of the locus), it suggests 
that the IgtE product performs the first biosynthetic step. Thus, the enzymes 

20 encoded by IgtA D, although intact, do not have a substrate to act upon. Mutant 
A5 (a deletion of die locus with the excqjtion of IgtE) gives rise to a LOS that is 
one step larger, supporting the idea that this gene accounts for the initial 
biosynthetic step. Note that the LOS of both 12 and A4 mutants is perceptibly 
larger than the LOS of strain 1291e which is known to be unable to add glucose, 

25 the first residue in the lacto-N-neotetraose chain. These data suggest that IgtE 
encodes the galactosyl transferase enzyme which adds the first galactose of the 
lacto-N-neotetraose. 

The LOS preparations were also studied usipg a dot blot technique for their 
reactivi^ with monoclonal antibodies. The monoclonal antibodies employed and 
30 their reported specificities are shown in Figure 1. The reactions observed with the 
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LOS obtained from the parental strain and the mutants are summarized in Figure 
8. The reactivity of the parental F62 with 1-1-M, 3F11 and L8 was as reported 
previously by Mandrell et al (Mandrell et al., 1985. Amer. Soc. Microbiology, 
Washington 379-384) and by Yamasaki et al (Yamasaki et al.. 1991. Mol. 
5 Immunol. 28: 1233). Mutants A4 and 12 fiul to react with any of the antibodies. 
However. A5 gives a stroi^ reaction with antibodies 4C4 and L8. indicating that 
the first galactose residue is presoit. This is in keqiing with the SDS-PAGE 
results (see fig 6) and siq)ports die role of IgfE as the galactosyl transferase. It 
also indicates that deletions iq>stream of IgtE do not significantly inactivate its 
10 fimction by polar effects. The LOS of F62 wt parent has strong reactivity with L3 
and weak reactivity with 3F11. It is known that reactivity 3F1 1 is occluded by the 
addition of the GalNAc residue (Schneider et al., J. Exp. Med. 174:1601); this is 
not the case with die L3 antibody. The wt LOS reacts wifli 1-1-M, the antibody 
reactive when the terminal GalNAc residue is present. The reactivity wifli 1-1-M 
15 is lost in A3 which has a deletion only in IgtD. This suggest diat diis gene 
encodes the GalNAc transferase. 

The reactivity with antibody LI (specific for the alternative LOS structure capped 
with an al-^KJal) is not seen in wt LOS. is absent in II. and all deletions which 
affect IgtC. The reactivity is strongest in Al, whrch has a deletion of IgtA only. 

20 Note that diis mutant also has lost reactivi^ wiUi 3F1 1 and L3. These two 

findings suggest diat IgtA codes for the GlcNAc transfterase. and when tfiis residue 
is not added, the incomplete chain is a substrate for the action of IgtC to produce 
die alternative LOS structure. Note diat die sizes of die LOS products seen in 
Figure 7 are m accord widi the immunological data. This conclusion suggests dut 

25 IgtC encodes the a-Gal transferase. This is finrther suppcurted by the weak 

reactiviQT of mutant A3 widi antibody LI . Mutant A3 has a deletion of IgtD and 
foils tt> add die terminal GalNAc. allowing die a-Gal transferase to modify die 
lacto-N-neotetraose groi^ to produce a Pj-Iike globoside (Mandrell, 1992, Infect. 
Immun. 60:3017). Mutant 13 (with inactive IgtB) has lost reactivity widi 1-1-M, 

30 3F11 and LI, and remains only weakly reactive witii L3. Togedier widi die size 
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of the product, these observations suggest that IgtB encodes the galactosyl 
transferase adding Gali91-*4 to the GlcNAc residue. Ricinus lectin RCA-I is 
specific for terminal galactose in /3 linkage (Nicolson and Blaustein, 1972, 
Biochim. Biophys. Acta 266:543; Lin and Li, 1980. Eur. J. Biochem, 105:453) 
5 and was used to confirm the presence of this structure on the LOS preparations. 
Using EUSA tests it was found thai wild type, A3, A2 and A5 LOS, expected to 
bear a terminal /JGal, bound the lectin (see Figure 7), while A4, 12, Al and 13 
were umeactive (data not shown). 

Discussion 

10 A locus containii^ 5 open reading frames has been cloned. The effect of eight 
defied mutations within this locus on the size and serological reactivity of the 
LOS produced by gonococcal transfonnants suggests that these genes are the 
glycosyl transferases responsible for the biosynthesis of most of the lacto-N- 
neotetraose chain. The data obtained allow an identification of the function of 

15 each of these genes. It is noteworthy that IgtB and IgtE, which are stnicmrally 
very closely related, also perform an apparently very similar biosynthetic task, i.e. 
the addition of Gal01-*4 to GIcNAc or Glc, respectively. Similarly, the closely 
related IgiA and IgtD add GalNAc or GlcNAc /81-*3, respectively, to a Gal 
residue. IgtQ which is unrelated to the other genes in the locus, is responsible for 

20 the addition of a Galal-»4. 

The DNA sequence showed that three of the genes (IgtA, IgtC and IgtD) contain 
tracts of guanosines which code for glycine residues in the proteins. These 
provide a potential mechanism for high-frequency variation of expression of these 
genes. Slippage in such poly-G tracts is well documented to control the expression 
25 of the gonococcal pilC genes, with resultant effects on pilus adhesiveness to human 
epithelial ceUs (Rudel et al., 1992, Molec. Microbiol. 6:3439). In strain F62, the 
numbers of bases in each of the three poly-G regions were such that the proteins 
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are in frame, and this is in keq)ing with the ability of F62 wild type to produce a 
complete LOS including the addition of the terminal GalNAc. 

Three aspects of LOS biosynthesis appear potentially to be subject to high 
frequency variation. The first is the addition of the terminal GalNAc (/^tf)). This 
5 would cause an alteration of reactivity with monoclonal antibody 1-1-M, and this 
phase variation has been r^Kuted by van Putten (Van Putten, 1993, EMBO J. 
12:4043). Similarly, a change m IgtA would cause the failure of the addition of 
GIcNAc to the growing chain and truncate die LOS at the /9-lactosyl level. This is 
a very common form of LOS in gonococci with a 3.6 kilodalton molecule, which 

10 confers resistance to tfie bactericidal effect of normal human serum (Schneider et 
al., 1985, Infect. Immun. 50:672). It is tempting to speculate that the in vitro 
variation between variant A and C of MSI 1^ from the jj-lactosyl chain to a 
conq>lete LOS (which bad a selective advantage in vivo in the volunteers) could be 
explained by regaining functional expression, of the GlcNAc transferase IgtA. 

15 Finally, the variable addition of al-*4Gal to either the 0-lactosyI (p'^-Iike globo- 
triose) or the lacto-N-neotetraose group (Pj-like globoside) (Mandrell, 1992, Infect. 
Immun. 60:3017) would be under the control of the expression of IgtC. The 
activity of the IgtC transfer^ appears to compete pooriy with the other 
transferases for precursor and its activity is evident only if either IgtA or IgtD are 

20 silent. For the Galal-*4GaI^l-*4Glc irisaccharide to be synthesized the GIcNAc 
transferase IgtA must be inactive and for expression of the Pplike globoside 
Galal-w|Gal/31^GlcNAc^l-i-3GaIi91-^lc the GalNAc transferase IgtD must be 
silent. 

Comparable high frequency antigenic variation of Haemophilus influenzae LOS has 
25 also been noted and has been attributed to changes in translational frame caused by 
shifts in the number of CAAT repeats in two separate loci, lid (Weiser et aL, 
1989, &11 59:657) and Uc2 (High et al., 1993, Molec. Microbiol. 9:1275). Shifts 
allowing the expression of the lic2 gene are correlated with the expression of an 
epitope with the structure Galal-i4Gali91-*. Since the lic2 gene is homologous to 
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IgtB and IgtE the galactosyl transferases which link Gal/31-wj to respectively GIc o 
GIcNAc. it is likely that this is its function in Haemophilus irfiuenzae LOS 
synthesis. It is remarkable that while both these mucosal pathogens have evolved 
frame shift mechanisms to cause antigenic variation of die LOS, diat die 
5 gonococcal homologs of Uc2, (IgtB and IgtE) are not die ones diat contam poly-G 
tracts. 

While die frame-shift mechanisms discussed above are suited for on/off regulation 
of gene e^qirBssion, die structure of die locus also lends itself to more subde 
regulation of die level of expression of die genes. It has been demonstrated diat 

10 growdi rate affects die molecular weight distribution and andgenic character LOS 
species produced (Morse et a!., 1983, Infect. Immun. 41:74). While I have not 
determined die size of die RNA transcripts it is very likely diat IgtA, IgtB and IgtC 
(in die instance where die poly-G tracts are such diat die coding frame is 
maintained) are transcribed togedier. The termination codon of IgtA and die 

15 initiation codon of IgtB in fact overlap, and the distance between the TAA of igtB 
and die ATG of IgtC is only 1 1 bp. SimUariy. die stop codon of IgtD and the start 
codon of IgtE are separated by only 18 bp. Yet die organization is such diat if 
any of die tiiree genes subject to phase variation are in die off configuration, 
transcription is able to reinitiate effectively at die beginning of die next gene. This 

20 ability to reinitiate transcription was cleariy seen widi die mutations constructed in 
this study. 

The correlation of LOS structure widi function is still in its early stages. The 
major advances in die field have been die development of an understanding of die 
structure of die molecules and die ability to relate diis, often unambiguously, to 
25 die reactivity widi a number of well-characterized monoclonal antibodies. Added 
to diis is the realization diat in die in vivo environment, whidi provides CMP- 
NANA, die organism may or may not sialylate die LOS, depending whedier die 
LOS syndiesized is a competent accqitor structure. It is well known diat 
sialylation induces a serum-resistant state in many strains. However, die effect of 
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sialylation in local infection is not as well studied, van Putten has shown that 
sialylation of LOS has a maiked inhibitory effect on epithelial cell invasion, 
without ^arenUy greatly altmng adhesion (Van Putten, 1993. EMBO J. 
12:4043). His studies suggest that in the mucosal infection, LOS structures that 
5 cannot be sialylated may be niqiortant for efRcient cell invasion. In the context of 
this report, such structures could be achieved either by the efficient addition of the 
terminal GalNAc or by shorteniqg the LOS chain by silencipg the GIcNAc 
transferase. The correlation of LOS chranistry witfi biological reaction has been 
complicated by the lealdness of the existirig LOS mutants isolated by pyocm 

10 selection (Dudas and Apicella, 1988, Infect. Immun. 56:499; Sandlin et al., 1993, 
Infect, bnmun. 61:3360). This is m fact exenq)lified with mutant 1291e which 
shows in addition to the major low molecular weight band, an additional higher 
band (see Figure 7). The new insight provided into the genetics of the 
biosynthesis of gonococcal LOS will allow construction of mutants that are not 

15 leaky. For instance, A4 aiKi A5 should be stable mutants since they no longer 
contain genes with poly-G tracts. The expression of the genes containing the poly- 
G tracts could be stabilized by engineering the areas so that glycines arc encoded 
by other codons. 

The present invention is not to be limited hi scope by the specific embodiments 
20 described herein, since such embodiments are intended as but single illustrations of 
one aspect of the invention and any functionally equivalent embodunents are within 
the scope of this invention. Indeed, various modifications of the invention in 
addition to those shown ami described herein will become appareitt to those skilled 
in the art from the foregoing description and accompanying drawings. Such 
25 modifications are intended to fall within the scope of the appended claims. It is 
also to be undersUKxi that all base pair sizes given for nucleotides are approxmiate 
and are used for the purpose of description. Various references are cited herein, 
the disclosures of which are incorporated by reference herein in their entirety. 
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WHATISCI^TMKnr^. 

11. A purified nucleic acid that is hybridizable under moderately stringent 

2 conditions to a nucleic acid having a nucleotide sequence coirespondmg to or 

3 complonentaiy to tbe nucleotide sequence shown in Figure 2 (SEQ ID NO:l). 

1 2. "nKiwcleic acid of Oaiml that is Iqrbridizable under 

2 conditions to a nucleic acid having a nucleotide sequence corresponding to or 

3 complementaiy to a portion of the nucleotide sequence shown in Figure 2 (SEQ ID 

4 NO: 1) diat encodes a fiinctionally active glycosyltransferase. 

13. The nucleic acid of Claim 2 that enco^ a iimctionaUy active 
2 glycosyltransfoase. 

14. The nucleic acid of Claim 1 that has a nucleotide sequence corresponding 

2 to or complementary to a portion of the nucleotide sequence shown in Figure 2 

3 (SEQ ID NO: 1) that encodes a functionally active glycosyltransferase. 

15. The nucleic acid of Claim 4 that encodes a functionally active 
2 glycosyltransferase. 



1 



6. The nucleic acid of Claim 1 that has a nucleotide sequence corresponding 
to or conq>leinentaTy to the nucleotide sequence shown in Figure 2 (SEQ ID 
NO:l). 



1 7. The nucleic acid of Claim 3, wherein the functionally active 

2 glycosyltransferase catalyzes a reaction selected from the group consisting of: 

3 a) adding Gal 01-*4 to GlcNAc or Glc; 

4 b) addiog GalNAc or GIcNAc /?l-3 to Gal; and 

5 c) adding Gal al-»4 to Gal. 
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1 8. The nucleic acid of Claim 3 which encodes a glycosyltransferase having an 

2 amino acid sequence of SEQ ID N0:3. 

1 9. The nucleic acid of Claim 3 which encodes a glycosyltransferase having an 

2 amino acid sequence of SEQ ID NO:8. 

1 10. The nucleic acid of Claim 3 which encodes a glycosyltransferase having an 

2 amino acid sequence of SEQ ID N0:4. 

1 11 . The nucleic acid of Clahn 3 which encodes a glycosyltransferase having an 

2 amino acid sequence of SEQ ID NO:S. 

1 12. The nucleic acid of Claim 3 which encodes a glycosyltransferase having an 

2 amino acid sequence of SEQ ID NO:6. 

1 13. An expression vector con^>rising the nucleic acid of Claim 3 operatively 

2 associated with an expression control sequence. 

1 14. A recombinant host cell transformed with the expression vector of Claim 

2 13. . 

A method for producing a glycosyltransferase comprising: 

a) culmring the recombinant host cell of Claim 14 under conditions 
that allow expression of the glycosyltransferase; and 

b) recovering the expressed glycosyltransferase. 

1 16. A glycosyltransferase having an amino acid sequence of SEQ ID NO:3, or 

2 a functionally active fragment thereof. 

1 17. A glycosyltransferase having an amino acid sequence of SEQ ID N0:8, or 

2 a functionally active fragment thereof. 



1 15. 
2 
3 
4 
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118. A glycosyltransferase having an amino acid sequence of SEQ ID NO:4, or 
2 a functionally active fragment thereof. 

1 19. A glycosyltransferase having an aniino acid sequence of SEQ ID N0:5, or 

2 a functionally active fragment thereof. 

1 20. A glycosyltransferase having an amino acid sequence of SEQ ID N0:6, or 

2 a functionally active fragment diereof . 

1 21. A composition comprising a glycosyltransferase conjugated to a solid phase 

2 support, wherein the glycosyltransferase is selected from the group consisting of: 



3 a) a glycosyltransferase having an amino acid sequence of SEQ ID 

4 , NO: 3, or a functionally active fragment thereof; 

5 b) a glycosyltransferase having an amino acid sequence of SEQ ID 

6 NO:8, or a iimctionally active fragment thereof; 

7 c) a glycosyltransferase having an amino acid sequence of SEQ ID 

8 NO:4, or a functionally active fragment thereof; 

9 d) a glycosyltransferase having an amino acid sequence of SEQ ID 

10 NO:S, or a functionally active fragment thereof; and 

11 e) a glycosyltransferase having an amino acid sequence of SEQ ID 

12 NO:6, or a functionally active fragment thereof. 



1 22. A method for adding GalNAc or GlcNAc |31-*3 to Gal, comprising 

2 contacting a reaction mixture comprising an activated GalNAc or GlcNAc to an 

3 acceptor moiety comprising a Gal residue in die presence of the 

4 glycosyltransferase of Claim 16. 

1 23. A method for adding Gal /31->4 to GlcNAc or GIc, comprising contacting a 

2 reaction mixture conq>risiiig an activated Gal to an acceptor moiety comprising a 

3 GlcNAc or Glc residue in the presence of the glycosyltransferase of Claim 17. 
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1 24, A method for adding Gal al-^ to Gal, comprising contacting a reaction 

2 mixture comprising an activated Gal to an acceptor moiety comprising a Gal 

3 residue in the presence of the glycosyltransferase of Claim 18. 

1 25, A method for adding GalNAc or GlcNAc /31-3 to Gal, comprising 

2 contacting a reaction mixture comprising an activated GalNAc or GlcNAc to an 

3 acceptor moiety comprising a Gal residue in die presence of the 

4 glycosyltransferase of Claim 19. 

1 26. A method for adding Gal 01^*4 to GlcNAc or Glc, comprising contacting a 

2 reaction mixture conq)risiiig an activated Gal to an acceptor moie^ comprising a 

3 GlcNAc or Glc residue in the presence of the glycosyltnuisferase of Claim 20. 

1 27. A method for prqiarii^ an oligosaccharide having the strucmre 

2 GaIal-4Gal/?l-^lc, which comprises sequentially performing the steps of: 

3 a) contacting a reaction mixture comprising an activated Gal to an 

4 acceptor moiety comprising a Glc residue in the presence of a 

5 glycosyltransferase having an amino acid sequence of SEQ ID NO:6, or a 

6 functionally active fragment thereof; and 

7 b) contacting a reaction mixture comprising an activated Gal to the 

8 acceptor moiety comprising GaIi31-«4Glc in the presence of a 

9 glycosyltransferase having an amino acid sequence of SEQ ID N0:4, or a 
10 functionally active fragment thereof. 



1 28, A method for preparing an oligosaccharide having the structure 

2 Gal/31-i4Glc, which comprises contacting a reaction mixture comprising an 

3 activated Gal to an acceptor moiety conqirisii^ a Glc residue in the presence of 

4 the glycosyltransferase of Claim 20. 

1 29. A method for preparii^ an oligosaccharide havmg the structure 

2 GlcNAc/Sl— 3Galj81-*4Glc, which comprises contacting a reaction mixture 
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3 coiiq>rising an activated GlcNAc to an acceptor moiety con^nising a Gali31-^lc 

4 residue in the presence of die glycosyltransferase of Claim 16. 

1 30. A method for preparing an oligosaccharide having the structure 

2 Galtf l-^lcNAc/Jl-^3Gal/Sl-^lc, which comprises contacting a reaction mixture 

3 comprising an activated Gal to an acceptor moiety comprising a 

4 GlcNAc^l-^3Gali81-i4GIc residue in die presence of die glycosyltransferase of 

5 Claim 17. 

1 31. A mediod for preparing an oligosaccharide having die structure 

2 GalNAcj81-^3GaliJl-^IcNAc^l-^3Galj81--4Glc, which comprises contacting a 

3 reaction nuxture conaprising an activated GalNAc to an acceptor moiety 

4 comprising a Gali81-i4GlcNAcj31-^3Gal<31-^lc residue in die presence of die 

5 glycosyltransferase of Claim 19. 



1 32. A method for preparing an oligosaccharide having the structure 

2 GalNAc/31-3Gal/Jl-^lcNAc01-.^3Gal/91-MGlc, which comprises sequentially 

3 perfonning the steps of: 

4 a) contacting a reaction mixture comprising an activated Gal to an 

5 acceptor moiety comprisuig a Glc residue in the presence of a 

6 glycosyltransferase havmg an amino acid sequence of SEQ ID NO: 6, or a 

7 functionally acdve fragment thereof; 

8 b) contacting a reaction mixture comprismg an activated GlcNAc to the 

9 acceptor moiety comprising a Gal/31-HNM3lc residue in the presence of a 

10 glycosyltransferase having an amino acid sequence of SEQ ID N0:3, or a 

11 functionally active fragment thereof ; 

12 c) contactiiig a reaction mixture comprismg an activated Gal to tte 

13 acceptor moiety comprising a GlcNAcpi-i^3Gal/51-*4Glc residue in the 

14 presence of a glycosyltransferase having an amino acid of SEQ ID NO:8; 

15 and 
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1^ comacting a reaction mixture comprising an activated GalNAc to the 

17 acceptor moiety comprising a GaI/31-^GlcNAc/Jl-*3Gal/31-^Ic i«sidue in 

18 the presence of a glycosyltransferase having an amino acid sequence of 

19 SEQ ID NO: 5, or a functionally active fragment thereof. 

1 33. A method for prq)aring an oligosaccharide having the structure 

2 Gaipi-^lcNAc^l-^3Galj81-^lc, which comprises sequentially performing the 

3 sieps of: 

4 a) contacting a reaction mixture comprising an activated Gal to an 

5 acceptor moiety comprismg a Glc residue in the presence of a 

6 glycosyltransfmse having an amino acid sequence of SEQ ID NO: 6, or a 

7 functionally active fragment thereof; 

8 b) contacting a reaction mixture comprising an activated GlcNAc to die 

9 acceptor moiety comprising a Gal)31-^Glc residue in the presence of a 

10 glycosyltransferase having an amino acid sequence of SEQ ID NO:3, or a 

11 functionally active fragment thereof; and 

^2 c) contacting a reaction mixture comprising an activated Gal to the 

13 acceptor moiety comprising a GIcNAc/31-*3Galj81-»4Glc residue in the 

14 presence of a glycosyltransferase having an amino acid of SEQ ID NO:8. 
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